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HUMAN TELOMERASE CATALYTIC SUBUNIT 



The present application is a continuation-in-part application of U.S. 



5 Patent Application Serial Number 08/915,503, filed August 14, 1997, and a 

continuation-in-part application of U.S. Patent Application Serial Number 08/912,951, 
filed August 14, 1997, and a continuation-in-part application of U.S. Patent Application 
Serial Number 08/91 1,3 12, filed August 14, 1997, all three of which are continuation- 
in-part applications of U.S. Patent Application Serial Number 08/854,050, filed May 9, 

1 0 1 997, which is a continuation-in-part application of U.S. Patent Application Serial 

Number 08/851,843, filed May 6, 1997, which is a continuation-in-part application of 
U.S. Patent Application Serial Number 08/846,017, filed April 25, 1997, which is a 
continuation-in-part application of U.S. Patent Application Serial Number 08/844,419, 
filed April 19, 1996, which is a continuation-in-part application of U.S. Patent 

15 Application Serial Number 08/724,643, filed October 1, 1996. This application also 
claims priority to Patent Convention Treaty Patent Application Serial No.: 
PCT/US97/17885 and to Patent Convention Treaty Patent Application Serial No.: 
PCT/US97/17618, both filed in the U.S. Receiving Office on October 1, 1997. Each of 
the aforementioned applications is explicitly incorporated herein by reference in its 

20 entirety and for all purposes. This application also incorporates by reference copending 

U.S. Patent Application Serial Number ["Telomerase Reverse 

Transcriptase," attorney docket no. 015389-002950] filed November 19, 1997, in its 
entirety and for all purposes. 



25 GM28039, awarded by the National Institutes of Health. The Government has certain 
rights in this invention. 
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invention is directed to the catalytic subunit of human telomerase. The invention 
provides methods and compositions relating to medicine, molecular biology, chemistry, 
pharmacology, and medical diagnostic and prognostic technology. 

5 BACKGROUND OF THE INVENTION 

The following discussion is intended to introduce the field of the present 
invention to the reader. The citation of various references in this section is not to be 
construed as an admission of prior invention. 

It has long been recognized that complete replication of the ends of 

10 eukaryotic chromosomes requires specialized cell components (Watson, 1972, Nature 
New Biol, 239:197; Olovnikov, 1973, J. Theor, Biol, 41:181). Replication of a linear 
DNA strand by conventional DNA polymerases requires an RNA primer, and can 
proceed only 5 f to 3\ When the RNA bound at the extreme 5' ends of eukaryotic 
chromosomal DNA strands is removed, a gap is introduced, leading to a progressive 

1 5 shortening of daughter strands with each round of replication. This shortening of 

telomeres,, the protein-DNA structures physically located on the ends of chromosomes, 
is thought to account for the phenomenon of cellular senescence or aging (see, e.g., 
Goldstein, 1990, Science 249:1 129; Martin et al., 1979, Lab. Invest. 23:86; Goldstein et 
al., 1969, Proc. Natl Acad ScL USA 64:155; and Schneider and Mitsui, 1976, Proc. 

20 Natl Acad ScL USA, 73:3584) of normal human somatic cells in vitro and in vivo. 

The length and integrity of telomeres is thus related to entry of a cell 
into a senescent stage (i.e., loss of proliferative capacity). Moreover, the ability of a 
cell to maintain (or increase) telomere length may allow a cell to escape senescence, 
i.e., to become immortal. 

25 The structure of telomeres and telomeric DNA has been investigated in 

numerous systems (see, e.g, Hariey and Villeponteau, 1995, Curr. Opin Genet Dev. 
5:249). In most organisms, telomeric DNA consists of a tandem array of very simple 
sequences; in humans and other vertebrates telomeric DNA consists of hundreds to 
thousands of tandem repeats of the sequence TTAGGG. Methods for determining and 

30 modulating telomere length in cells are described in PCT Publications WO 93/23572 
and WO 96/41016. 
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The maintenance of telomeres is a function of a telomere-specific DNA 
polymerase known as telomerase. Telomerase is a ribonucleoprotein (RNP) that uses a 
portion of its RNA moiety as a template for telomere repeat DNA synthesis (Morin, 
1997, Eur. J. Cancer 33:750; Yu et al., 1990, Nature 344:126; Singer and Gottschling, 
5 7994 Science 266:404; Autexier and Greider, 1994, Genes Develop., 8:563; Gilley et 
al, 1995, Genes Develop., 9:2214; McEachern and Blackburn, 1995, Nature 367:403; 
Blackburn, 1992, Ann. Rev. Biochem., 61:113;. Greider, 1996, Ann. Rev. Biochem., 
65:337). The RNA components of human and other telomerases have been cloned and 
characterized (see, PCT Publication WO 96/01835 and Feng et al., 1995, Science 

10 269:1236). However, the characterization of the protein components of telomerase has 
been difficult. In part, this is because it has proved difficult to purify the telomerase 
RNP, which is present in extremely low levels in cells in which it is expressed. For 
example, it has been estimated that human cells known to express high levels of 
telomerase activity may have only about one hundred molecules of the enzyme per cell. 

1 5 Consistent with the relationship of telomeres and telomerase to the 

proliferative capacity of a cell (i.e., the ability of the cell to divide indefinitely), 
telomerase activity is detected in immortal cell lines and an extraordinarily diverse set 
of tumor tissues, but is not detected (i.e., was absent or below the assay threshold) in 
normal somatic cell cultures or normal tissues adjacent to a tumor (see, U.S. Patent 

20 Nos. 5,629,154; 5,489,508; 5,648,215; and 5,639,613; see also, Morin, 1989, Cell 59: 
521; Shay and Bacchetti 1997, Eur. J. Cancer 33:787; Kim et al., 1994, Science 
266:2011; Counter et al., 1992, EMBOJ. 11:1921; Counter et al., l994,Proc. Natl 
Acad. Sci. U.S.A. 91,2900; Counter et al., 1994, J. Virol 68:3410). Moreover, a 
correlation between the level of telomerase activity in a tumor and the likely clinical 

25 outcome of the patient has been reported (e.g., U.S. Patent No. 5,639,613, supra; 
Langford et al., 1997, Hum. Pathol 28:416). Telomerase activity has also been 
detected in human germ cells, proliferating stem or progenitor cells, and activated 
lymphocytes. In somatic stem or progenitor cells, and in activated lymphocytes, 
telomerase activity is typically either very low or only transiently expressed (see, Chiu 

30 et al., 1996, Stem Cells 14:239; Bodnar et al., 1996, Exp. Cell Res. 228:58; Taylor et 
al., 1996, J. Invest. Dermatology 106: 759). 
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Human telomerase is an ideal target for diagnosing and treating human 
diseases relating to cellular proliferation and senescence, such as cancer. Methods for 
diagnosing and treating cancer and other telomerase-related diseases in humans are 
described in U.S. Patent Nos. 5,489,508, 5,639,613, and 5,645,986. Methods for 
5 predicting tumor progression by monitoring telomerase are described in U.S. Patent No. 
5,639,613. The discovery and characterization of the catalytic protein subunit of 
human telomerase would provide additional useful assays for telomerase and for 
disease diagnosis and therapy. Moreover, cloning and determination of the primary 
sequence of the catalytic protein subunit would allow more effective therapies for 
10 human cancers and other diseases related to cell proliferative capacity and senescence. 

BRIEF SUMMARY OF THE INVENTION 

The present invention provides an isolated, substantially pure, or 
recombinant protein preparation of a telomerase reverse transcriptase protein, or a 
15 variant thereof, or a fragment thereof. In one embodiment the protein is characterized 
as having a defined motif that has an amino acid sequence: 

Trp-R r X 7 -R r R r R 2 -X-Phe-Phe^ 
where X is any amino acid and a subscript refers to the number of consecutive residues, 
Rj is leucine or isoleucine, R 2 is glutamine or arginine, R 3 is phenylalanine or tyrosine, 
20 and R4 is lysine or histidine. In one embodiment the protein has a sequence of human 
TRT. In other embodiments, the invention relates to peptides and polypeptides sharing 
substantial sequence identity with a subsequence of such proteins. 

In a related embodiment the invention provides an isolated, substantially 
pure or recombinant nucleic acid that encodes a telomerase reverse transcriptase 
25 protein. In one embodiment the nucleic acid encodes a protein comprising an amino 
acid sequence: 

Trp-R r X r R r R r R 2 -X-Phe-Phe-T^^ In 
another embodiment, the nucleic acid has a sequence that encodes the human TRT 
protein. In other embodiments, the invention relates to oligonucleotides and 
30 polynucleotides sharing substantial sequence identity or complementarity with a 
subsequence of such nucleic acids. 
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In one embodiment, the invention relates to human telomerase reverse 
transcriptase (hTRT) protein. Thus, in one embodiment, the invention provides an 
isolated, substantially pure, or recombinant protein preparation of an hTRT protein, or a 
variant thereof, or a fragment thereof. In one embodiment, the protein is characterized 
5 by having an amino acid sequence with at least about 75% or at least about 80% 
sequence identity to the hTRT protein of Figure 17 (SEQUENCE ID NO: 2), or a 
variant thereof, or a fragment thereof. In a related aspect, the hTRT protein has the 
sequence of SEQUENCE ID NO: 2. In some embodiments, the protein has one or 
more telomerase activities, such as catalytic activity. In one embodiment, the hTRT 
1 0 protein fragment has at least 6 amino acid residues. In other embodiments, the hTRT 

protein fragment has at least 8, at least about 10, at least about 12, at least about 15 or at 
least about 20 contiguous amino acid residues of a naturally occurring hTRT 
polypeptide. In still other embodiments, the hTRT protein fragment has at least about 
50 or at least about 100 amino acid residues. 
1 5 The invention also provides a composition comprising an hTRT protein 

and an RNA. The RNA may be a telomerase RNA, such as a human telomerase RNA. 
In one embodiment, the hTRT protein and the human telomerase RNA (hTR) from a 
ribonucleoprotein complex with a telomerase activity. 

In one embodiment, the invention provides isolated human telomerase 
20 comprising hTRT protein, such as a substantially pure human telomerase comprising 

hTRT protein and comprising hTR. In one embodiment, the telomerase is at least about 
95% pure. The telomerase may be isolated from a cell, such as a recombinant host cell 
in or a cell that expresses telomerase activity. 

In another aspect, the invention provides an isolated, synthetic, 
25 substantially pure, or recombinant polynucleotide comprising a nucleic acid sequence 
that encodes an hTRT protein. In one embodiment, the polynucleotide has a nucleotide 
sequence encoding an hTRT protein that has an amino acid sequence as set forth in 
Figure 17 (SEQUENCE ID NO:2) or a sequence that comprises one or more 
conservative amino acid (or codon) substitutions or one or more activity-altering amino 
30 acid (or condon) substitutions in said amino acid sequence. In a related aspect, the 
polynucleotide hybridizes under stringent conditions to a polynucleotide having the 
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sequence as set forth in in Figure 16 (SEQUENCE ID N0:1). In another related aspect, 
the nucleotide sequence of the polynucleotide has a smallest sum probability of less 
than about 0.5 when compared to a nucleotide sequence as set forth in Figure 16 
(SEQUENCE ID NO:l) using BLAST algorithm with default parameters. 

In another aspect, the invention provides a polynucleotide having a 
promoter sequence operably linked to the sequence encoding the hTRT protein. The 
promoter may be a promoter other than the naturally occurring hTRT promoter. In a 
related aspect, the invention provides an expression vector comprising the promoter of 
the hTRT. 

The invention also provides an isolated, synthetic, substantially pure, or 
recombinant polynucleotide that is at least ten nucleotides in length and comprises a 
contiguous sequence of at least ten nucleotides that is identical or exactly 
complementary to a contiguous sequence in a naturally occurring hTRT gene or hTRT 
mRNA. In some embodiments the polynucleotide is an RNA, a DNA, or contains one 
or more non-naturally occurring, synthetic nucleotides. In one aspect, the 
polynucleotide is identical or exactly complementary to the contiguous sequence of at 
least ten contiguous nucleotides in a naturally occurring hTRT gene or hTRT mRNA. 
For example, the polynucleotide may be an antisense polynucleotide. In one 
embodiment, the antisense polynucleotide comprises at least about 20 nucleotides. 

The invention further provides a method of preparing recombinant 
telomerase by contacting a recombinant hTRT protein with a telomerase RNA 
component under conditions such that said recombinant protein and said telomerase 
RNA component associate to form a telomerase enzyme capable of catalyzing the 
addition of nucleotides to a telomerase substrate. In one embodiment, the hTRT protein 
has a sequence as set forth in Figure 17 (SEQUENCE ID NO:2). The hTRT protein 
may be produced in an in vitro expression system and mixed with a telomerase RNA 
or, in another embodiment, the telomerase RNA can be co-expressed in the in vitro 
expression system. In one embodiment the telomerase RNA is hTR. In an alternative 
embodiment, the contacting occurs in a cell, such as a human cell. In one embodiment, 
the cell does not have telomerase activity prior to the contacting of the hTRT and the 
RNA, or the introduction, such as by transfection, of an hTRT polynucleotide. In one 



embodiment, the telomerase RNA is expressed naturally by said cell. 

The invention also provides a cell, such as a human, mouse, or yeast 
cell, containing the recombinant polynucleotides of the invention such as a 
polynucleotide with an hTRT protein coding sequence operably linked a promoter. In 

5 particular aspects, the cell is a vertebrate cell, such as a cell from a mammal, for 

example a human, and has an increased proliferative capacity relative to a cell that is 
otherwise identical but does not comprise the recombinant polynucleotide or has an 
increased telomerase activity level relative to a cell that is otherwise identical but does 
not comprise the recombinant polynucleotide. In some embodiments the cell is 

10 immortal. 

In related embodiments, the invention provides organisms and cells 
comprising a polynucleotide encoding a human telomerase reverse transcriptase 
polypeptide, such as a transgenic non-human organism such as a yeast, plant, 
bacterium, or a non-human animal, for example, a mouse. The invention also provides 

1 5 for transgenic animals and cells from which an hTRT gene has been deleted (knocked- 
out) or mutated such that the gene does not express a naturally occurring hTRT gene 
product. Thus, in alternative embodiments, the transgenic non-human animal has a 
mutated telomerase gene, is an animal deficient in a telomerase activity, is an animal 
whose TRT deficiency is a result of a mutated gene encoding a TRT having a reduced 

20 level of a telomerase activity compared to a wild-type TRT and is an animal having a 
mutated TRT gene with one or more mutations, including missense mutations, 
nonsense mutations, insertions, or deletions. 

The invention also provides an isolated or recombinant antibody, or 
fragment thereof, that specifically binds to an hTRT protein. In one embodiment, the 

25 antibody binds with an affinity of at least about 10 8 M" 1 . The antibody may be 

monoclonal or may be a polyclonal composition, such as a polyclonal antisera. In a 
related aspect, the invention provides a cell capable of secreting the antibody, such as a 
hybridoma. 

The invention also provides a method for determining whether a 
30 compound or treatment is a modulator of a telomerase reverse transcriptase activity or 
hTRT expression. The method involves detecting or monitoring a change in activity or 
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expression in a cell, animal or composition comprising an hTRT protein or 
polynucleotide following administration of the compound or treatment. In one 
embodiment, the method includes the steps of: providing a TRT composition, 
contacting the TRT with the test compound and measuring the activity of the TRT, 
where a change in TRT activity in the presence of the test compound is an indicator 
that the test compound modulates TRT activity. In certain embodiments, the 
composition is a cell, an organism, a transgenic organism or an in vitro system, such as 
an expression system, which contains a recombinant polynucleotide encoding an hTRT 
polypeptide. Thus, the hTRT of the method may be a product of in vitro expression. In 
various embodiments the detection of telomerase activity or expression may be by 
detecting a change in abundance of an hTRT gene product, monitoring incorporation of 
a nucleotide label into a substrate for telomerase, monitoring hybridization of a probe to 
an extended telomerase substrate, monitoring amplification of an extended telomerase 
substrate, monitoring telomere length of a cell exposed to the test compound, 
monitoring the loss of the ability of the telomerase to bind to a chromosome, or 
measuring the accumulation or loss of telomere structure. 

In one aspect, the invention provides a method of detecting an hTRT 
gene product in a biological sample by contacting the biological sample with a probe 
that specifically binds the gene product, wherein the probe and the gene product form a 
complex, and detecting the complex, where the presence of the complex is correlated 
with the presence of the hTRT gene product in the biological sample. The gene product 
may be RNA, DNA or a polypeptide. Examples of probes that may be used for 
detection include, but are not limited to, nucleic acids and antibodies. 

In one embodiment, the gene product is a nucleic acid which is detected 
by amplifying the gene and detecting the amplification product, where the presence of 
the complex or amplification product is correlated with the presence of the hTRT gene 
product in the biological sample. 

In one embodiment, the biological sample is from a patient, such as a 
human patient. In another embodiment the biological sample includes at least one cell 
from an in vitro cell culture, such as a human cell culture. 

The invention further provides a method of detecting the presence of at 
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least one immortal or telomerase positive human cell in a biological sample comprising 
human cells by obtaining the biological sample comprising human cells; and detecting 
the presence in the sample of a cell having a high level of an hTRT gene product, where 
the presence of a cell having a high level of the hTRT gene product is correlated with 
5 the presence of immortal or telomerase positive cells in the biological sample. 

The invention also provides a method for diagnosing a telomerase- 
related condition in a patient by obtaining a cell or tissue sample from the patient, 
determining the amount of an hTRT gene product in the cell or tissue; and comparing 
the amount of hTRT gene product in the cell or tissue with the amount in a healthy cell 
1 0 or tissue of the same type, where a different amount of hTRT gene product in the 
sample from the patient and the healthy cell or tissue is diagnostic of a telomerase- 
related condition. In one embodiment the telomerase-related condition is cancer and a 
greater amount of hTRT gene product is detected in the sample. 

The invention further provides a method of diagnosing cancer in a 
1 5 patient by obtaining a biological sample from the patient, and detecting a hTRT gene 
product in the patient sample, where the detection of the hTRT gene product in the 
sample is correlated with a diagnosis of cancer. 

The invention further provides a method of diagnosing cancer in a 
patient by obtaining a patient sample, determining the amount of hTRT gene product in 
20 the patient sample; and comparing the amount of hTRT gene product with a normal or 
control value, where an amount of the hTRT gene product in the patient that is greater 
than the normal or control value is diagnostic of cancer. 

The invention also provides a method of diagnosing cancer in a patient, 
by obtaining a patient sample containing at least one cell; determining the amount of 
25 an hTRT gene product in a cell in the sample; and comparing the amount of hTRT gene 
product in the cell with a normal value for the cell, wherein an amount of the hTRT 
gene product greater than the normal value is diagnostic of cancer. In one embodiment, 
the sample is believed to contain at least one malignant cell. 

The invention also provides a method for a prognosing a cancer patient 
30 by determining the amount of hTRT gene product in a cancer cell obtained from the 
patient; and comparing the amount of hTRT in the cancer cell with a prognostic value 
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of hTRT consistent with a prognosis for the cancer; where an amount of hTRT in the 
sample that is at the prognostic value provides the particular prognosis. 

The invention also provides a method for monitoring the ability of an 
anticancer treatment to reduce the proliferative capacity of cancer cells in a patient, by 
5 making a first measurement of the amount of an hTRT gene product in at least one 
cancer cell from the patient; making a second measurement of the level of the hTRT 
gene product in at least one cancer cell from the patient, wherein the anticancer 
treatment is administered to the patient before the second measurement; and comparing 
the first and second measurements, where a lower level of the hTRT gene product in the 

10 second measurement is correlated with the ability of an anticancer treatment to reduce 
the proliferative capacity of cancer cells in the patient. 

The invention also provides kits for the detection of an hTRT gene or 
gene product. In one embodiment, the kit includes a container including a molecule 
selected from an hTRT nucleic acid or subsequence thereof, an hTRT polypeptide or 

1 5 subsequence thereof, and an anti-hTRT antibody. 

The invention also provides methods of treating human diseases. In one 
embodiment, the invention provides a method for increasing the proliferative capacity 
of a vertebrate cell, such as a mammalian cell, by introducing a recombinant 
polynucleotide into the cell, wherein said polynucleotide comprises a sequence 

20 encoding an hTRT polypeptide. In one embodiment, the hTRT polypeptide has a 

sequence as shown in Figure 17. In one embodiment, the sequence is operably linked 
to a promoter. In one embodiment, the hTRT has telomerase catalytic activity. In one 
embodiment, the cell is human, such as a cell in a human patient. In an alternative 
embodiment, the cell is cultured in vitro. In a related embodiment, the cell is 

25 introduced into a human patient. 

The invention further provides a method for treating a human disease by 
introducing recombinant hTRT polynucleotide into at least one cell in a patient. In one 
embodiment, a gene therapy vector is used. In a related embodiment, the method 
further consists of introducing into the cell a polynucleotide comprising a sequence 

30 encoding hTR, for example, an hTR polynucleotide operably linked to a promoter. 

The invention also provides a method for increasing the proliferative 
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capacity of a vertebrate cell, said method comprising introducing into the cell an 
effective amount of hTRT polypeptide. In one embodiment the hTRT polypeptide has 
telomerase catalytic activity. The invention further provides cells and cell progeny with 
increased proliferative capacity. 

The invention also provides a method for treating a condition associated 
with an elevated level of telomerase activity within a cell, comprising introducing into 
said cell a therapeutically effective amount of an inhibitor of said telomerase activity, 
wherein said inhibitor is an hTRT polypeptide or an hTRT polynucleotide. In one 
embodiment, the inhibitor is a polypeptide or polynucleotide comprising, e.g., at least a 
subsequence of a sequence shown in Figures 16, 17, or 20. In additional embodiments, 
the polypeptide or polynucleotide inhibits a TRT activity, such as binding of 
endogenous TRT to telomerase RNA. 

The invention also provides a vaccine comprising an hTRT polypeptide 
and an adjuvant. The invention also provides pharmacological compositions containing 
a pharmaceutical^ acceptable carrier and a molecule selected from: an hTRT 
polypeptide, a polynucleotide encoding an hTRT polypeptide, and an hTRT nucleic 
acid or subsequence thereof. 

DESCRIPTION OF THE FIGURES 

Figure 1 shows highly conserved residues in TRT motifs from human, S. 
pombe (tezl), S. cerevisiae (EST2) and Euplotes aediculatus (pi 23). Identical amino 
acids are indicated with an asterisk (*) [raised slightly], while the similar amino acid 
residues are indicated by a dot (•). Motif "0" in the figure is also called Motif T; Motif 
"3" is also called Motif A. 

Figure 2 shows the location of telomerase-specific and RT-specific 
sequence motifs of telomerase proteins and other reverse transcriptases. Locations of 
telomerase-specific motif T and conserved RT motifs 1, 2 and A-E are indicated by 
boxes. The open rectangle labeled HIV-1 RT delineates the portion of this protein 

shown in Figure 3. 

Figure 3 shows the crystal structure of the p66 subunit of HIV-1 reverse 
transcriptase (Brookhaven code 1HNV). The view is from the back of the right hand to 
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enable all motifs to be shown. 

Figure 4 shows multiple sequence alignment of telomerase RTs 
(SpJTrtlp, S. pombe TRT [also referred to herein as "tezlp"]; hTRT, human TRT; 
Ea__pl23, Euplotes pi 23; Sc_Est2p, S. cerevisiae Est2p) and members of other RT 
5 families (Sc_al, cytochrome oxidase group II intron 1 -encoded protein from S. 
cerevisiae mitochondria, DmJTART, reverse transcriptase from Drosophila 
melanogaster TART non-LTR retrotransposable element); HIV-1, human 
immunodeficiency virus reverse transcriptase). TRT con and RT con represent 
consensus sequences for telomerase RTs and non-telomerase RTs. Amino acids are 
10 designated with an h, hydrophobic; p, polar; c, charged. Triangles show residues that 
are conserved among telomerase proteins but different in other RTs. The solid line 
below motif E highlights the primer grip region. 

Figure 5 shows expression of hTRT RNA in telomerase-negative mortal 
cell strains and telomerase-positive immortal cell lines as described in Example 2. 
15 Figure 6 shows a possible phylogenetic tree of telomerases and 

retroelements rooted with RNA-dependent RNA polymerases. 

Figure 7 shows a restriction map of lambda clone Gcj>5. 

Figure 8 shows a map of chromosome 5p with the location of the STS 
marker D5S678 (located near the hTRT gene) indicated. 
20 Figure 9 shows the construction of a hTRT promoter-reporter plasmid. 

Figure 10, in two pages, shows coexpression in vitro of hTRT and hTR 
to produce catalytically active human telomerase. 

Figure 1 1, in two pages, shows an alignment of sequences from four 
TRT protein and identifies motifs of interest. TRT con shows a TRT consensus 
25 sequence. RT con shows consensus residues for other reverse transcriptases. 

Consensus residues in upper case indicate absolute conservation in TRT proteins. 

Figure 12 shows a Topoisomerase II cleavage site and NFkB binding 
site motifs in an hTRT intron, with the sequence shown corresponding to SEQUENCE 
IDNO:7. 

30 Figure 13, in two pages, shows the sequence of the DNA encoding the 

Euplotes 123 kDa telomerase protein subunit (Euplotes TRT). 

12 
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Figure 14 shows the amino acid sequence of the Euplotes 123 kDa 
telomerase protein subunit {Euplotes TRT protein). 

Figure 15, in five pages, shows the DNA and amino acid sequences of 
the S. pombe telomerase catalytic subunit (& pombe TRT). 
5 Figure 16, in two pages, shows the hTRT cDNA sequence, with the 

sequence shown corresponding to SEQUENCE ID NO: 1. 

Figure 17 shows the hTRT protein encoded by the cDNA of Figure 16. 
The protein sequence shown corresponds to SEQUENCE ID NO: 2. 

Figure 18 shows the sequence of clone 712562, with the sequence 
10 shown corresponding to SEQUENCE ID NO: 3. 

Figure 19 shows a 259 residue protein encoded by clone 712562, with 
the sequence shown corresponding to SEQUENCE ID NO: 10. 

Figure 20 shows, in seven pages, the sequence of a nucleic acid with an 
open reading frame encoding a A 182 variant polypeptide, with the sequence shown 
15 corresponding to SEQUENCE ID NO: 4. This Figure also shows the amino acid 
sequence of this A 182 variant polypeptide, with the amino acid sequence shown 
corresponding to SEQUENCE ID NO: 5. 

Figure 21 shows, in six pages, sequence from an hTRT genomic clone, 
with the sequence shown corresponding to SEQUENCE ID NO: 6. Consensus motifs 
20 and elements are indicated, including sequences characteristic of a topoisomerase II 
cleavage site, NFkB binding sites, an Alu sequence and other sequence elements. 

Figure 22 shows the effect of mutation of the TRT gene in yeast, as 
described in Example 1. 

Figure 23 shows the sequence of EST AA281296, corresponding to 
25 SEQUENCE ID NO: 8. 

Figure 24 shows the sequence of the 1 82 basepairs deleted in clone 
712562, with the sequence shown corresponding to SEQUENCE ID NO: 9. 

Figure 25 shows the results of an assay for telomerase activity from BJ 
cells transfected with an expression vector encoding an hTRT protein (pGRN133) or a 
30 control plasmid (pBBS212) as described in Example 13. 

Figure 26 is a schematic diagram of the affinity purification of 
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telomerase showing the binding and displacement elution steps. 

Figure 27 is a photograph of a Northern blot of telomerase preparations 
obtained during a purification protocol, as described in Example 1. Lane 1 contained 
L5 fmol telomerase RNA, lane 2 contained 4.6 fmol telomerase RNA, lane 3 contained 
14 fmol telomerase RNA, lane 4 contained 41 fmol telomerase RNA, lane 5 contained 
nuclear extract (42 fmol telomerase), lane 6 contained Affi-Gel-heparin-purified 
telomerase (47 fmol telomerase), lane 7 contained affinity-purified telomerase (68 
fmol), and lane 8 contained glycerol gradient-purified telomerase (35 fmol). 

Figure 28 shows telomerase activity through a purification protocol. 

Figure 29 is a photograph of a SDS-PAGE gel, showing the presence of 
an approximately 123 kDa polypeptide and an approximately 43 kDa doublet from 

Euplotes aediculatus. 

Figure 30 is a graph showing the sedimentation coefficient of Euplotes 

aediculatus telomerase. 

Figure 31 is a photograph of a polyacrylamide/urea gel with 36% 
formamide showing the substrate utilization of Euplotes telomerase. 

Figure 32 shows the putative alignments of telomerase RNA template, 
and hairpin primers with telomerase RNA. 

Figure 33 is a photograph of lanes 25-30 of the gel shown in Figure 3 1 , 
shown at a lighter exposure level. 

Figure 34 shows the DNA sequence of the gene encoding the 43 kDa 
telomerase protein subunit from Euplotes. 

Figure 35 shows, in four pages, the DNA sequence, as well as the amino 
acid sequences of all three open reading frames of the 43 kDa telomerase protein 
subunit from Euplotes. 

Figure 36 shows a sequence comparison between the 123 kDa 
telomerase protein subunit of Euplotes (upper sequence) and the 80 kDa polypeptide 
subunit of T. thermophila (lower sequence). 

Figure 37 shows a sequence comparison between the 123 kDa 
telomerase protein subunit of E. aediculatus (upper sequence) and the 95 kDa 
telomerase polypeptide of T. thermophila (lower sequence). 
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Figure 38 shows the best-fit alignment between a portion of the "La- 
domain" of the 43 kDa telomerase protein subunit of E. aediculatus (upper sequence) 
and a portion of the 95 kDa polypeptide subunit of T. thermophila (lower sequence). 

Figure 39 shows the best-fit alignment between a portion of the "La- 
5 domain" of the 43 kDa telomerase protein subunit of E. aediculatus (upper sequence) 
and a portion of the 80 kDa polypeptide subunit of T, thermophila (lower sequence). 

Figure 40 shows the alignment and motifs of the polymerase domain of 
the 123 kDa telomerase protein subunit of E. aediculatus and the polymerase domains 
of various reverse transcriptases including a cytochrome oxidase group II intron 
10 1 -encoded protein from S. cerevisiae mitochondria (al S.c. (group II)), Dong (LINE), 
and yeast ESTp (L8543.12). 

Figure 41 shows the alignment of a domain of the 43 kDa telomerase 
protein subunit with various La proteins. 

Figure 42 shows the nucleotide sequence encoding the T. thermophila 

1 5 80 kDa protein subunit. 

Figure 43 shows the amino acid sequence of the T. thermophila 80 kDa 

protein subunit. 

Figure 44 shows the nucleotide sequence encoding the T. thermophila 
95 kDa protein subunit. 
20 Figure 45 shows the amino acid sequence of the T. thermophila 95 kDa 

protein subunit. 

Figure 46 shows the amino acid sequence of L8543.12 ( ,! Est2p n ). 

Figure 47 shows the alignment of the amino acid sequence encoded by 
the Oxytricha PCR product with the Euplotes pi 23 sequence. 
25 Figure 48 shows the DNA sequence of Est2. 

Figure 49 shows partial amino acid sequence from a cDNA clone 
encoding human telomerase peptide motifs. 

Figure 50 shows partial DNA sequence of a cDNA clone encoding 
human telomerase peptide motifs. 
30 Figure 51 shows the amino acid sequence of tezl, also called S. pombe 

trt. 
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Figure 52 shows, in two pages, the DNA sequence of tezL Intronic and 

other non-coding regions are shown in lower case and exons (i.e., coding regions) are 

shown in upper case. 

Figure 53 shows the alignment of EST2p, Euplotes, and Tetrahymena 

5 sequences, as well as consensus sequence. 

Figure 54 shows the sequences of peptides useful for production of anti- 

hTRT antibodies. 

Figure 55 is a schematic summary of the tez/ + sequencing experiments. 

Figure 56 shows two degenerate primers used in PCR to identify the S. 
10 pombe homolog of the E. aediculatus pi 23 sequences. 

Figure 57 shows the four major bands produced in PCR using 
degenerate primers to identify the S. pombe homolog of the E. aediculatus pi 23 
sequences. 

Figure 58 shows the alignment of the M2 PCR product with E. 
1 5 aediculatus pl23, S. cerevisiae, and Oxytricha telomerase protein sequences. 

Figure 59 is a schematic showing the 3 ! RT PCR strategy for identifying 
the S. pombe homolog of the E. aediculatus pi 23. 

Figure 60 shows characteristics of the libraries used to screen for S. 
pombe telomerase protein sequences and shows the results of screening the libraries for 
20 S. pombe telomerase protein sequences. 

Figure 61 shows the positive results obtained with the /fmdIII-digested 
positive genomic clones containing S. pombe telomerase sequence. 

Figure 62 is a schematic showing the 5* RT PCR strategy used to obtain 
a full length S. pombe TRT clone. 
25 Figure 63 shows the alignment of RT domains from telomerase catalytic 

subunits for S. pombe (S.p.), S. cerevisiae (S.c.) and E. aediculatus (E.a.). 

Figure 64 shows the alignment of the sequences from Euplotes 
("Ea _pl23")> S. cerevisiae ("ScEst2p"), and S. pombe ("SpJTezlp"). In Panel A, the 
shaded areas indicate residues shared between two sequences. In Panel B, the shaded 
30 areas indicate residues shared between all three sequences. 

Figure 65 shows the disruption strategy used with the telomerase genes 



16 



in S. pombe. 

Figure 66 shows the experimental results confirming disruption of tezL 
Figure 67 shows the progressive shortening of telomeres in S. pombe 
due to tezl disruption. 

5 Figure 68 shows ? in four pages, the DNA and amino acid of the ORF 

encoding an approximately 63 kDa telomerase protein encoded by the EcoRI-NotI 
insert of clone 712562. 

Figure 69 shows an alignment of reverse transcriptase motifs from 
various sources. 

1 0 Figure 70 provides a restriction and function map of plasmid pGRN 121. 

Figure 71 shows, in two pages, the results of preliminary nucleic acid 
sequencing analysis of a hTRT cDNA sequence. 

Figure 72 shows, in ten pages, the preliminary nucleic acid sequence of 
hTRT and deduced ORF sequences in three reading frames. 
1 5 Figure 73 provides a restriction and function map of plasmid pGRN 121. 

Figure 74 shows, in eight pages, refined nucleic acid sequence and 
deduced ORF sequences of hTRT 

Figure 75 shows a restriction map of lambda clone 25-1.1. 
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DETAILED DESCRIPTION OF THE INVENTION 
I. INTRODUCTION 

Telomerase is a ribonucleoprotein complex (RNP) comprising an RNA 
component and a catalytic protein component. The present invention relates to the 
5 cloning and characterization of the catalytic protein component of telomerase, 

hereinafter referred to as "TRT" (telomerase reverse transcriptase). TRT is so named 
because this protein acts as an RNA-dependent DNA polymerase (reverse 
transcriptase), using the telomerase RNA component (hereinafter, "TR") to direct 
synthesis of telomere DNA repeat sequences. Moreover, TRT is evolutionarily related 

10 to other reverse transcriptases (see Example 12). 

In one aspect, the present invention relates to the cloning and 
characterization of the catalytic protein component of human telomerase, hereinafter 
referred to as "hTRT." Human TRT is of extraordinary interest and value because, as 
noted supra, telomerase activity in human (and other mammalian cells) correlates with 

1 5 cell proliferative capacity, cell immortality, and the development of a neoplastic 

phenotype. For example, telomerase activity, and, as demonstrated in Example 2, infra, 
levels of human TRT gene products and are elevated in immortal human cells (such as 
malignant tumor cells and immortal cell lines) relative to mortal cells (such as most 
human somatic cells). 

20 The present invention further provides methods and compositions 

valuable for diagnosis, prognosis, and treatment of human diseases and disease 
conditions, as described in some detail infra. Also provided are methods and reagents 
useful for immortalizing cells (in vivo and ex vivo), producing transgenic animals with 
desirable characteristics, and numerous other uses, many of which are described infra, 

25 The invention also provides methods and reagents useful for preparing, cloning, or re- 
cloning TRT genes and proteins from ciliates, fungi, vertebrates, such as mammals, and 
other organisms. 

As described in detail infra, TRT was initially characterized following 
purification of telomerase from the ciliate Euplotes aediculatus. Extensive purification 
30 of E. aediculatus telomerase, using RNA-affinity chromatography and other methods, 
yielded the protein "pi 23". Surprisingly, pi 23 is unrelated to proteins previously 
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believed to constitute the protein subunits of the telomerase holoenzyme (i.e., the p80 
and p95 proteins of Tetrahymena thermophila). Analysis of the pi 23 DNA and protein 
sequences (Genbank Accession No. U95964; Figures 13 and 14) revealed reverse 
transcriptase (RT) motifs consistent with the role of pi 23 as the catalytic subunit of 

5 telomerase (see, e.g., Figures 1, 4 and 11). Moreover, pi 23 is related to a S. cerevisiae 
(yeast) protein, Est2p, which was known to play a role in maintenance of telomeres in 
S. cerevisiae (Genbank Accession No. S5396), but prior to the present invention was 
not recognized as encoding a telomerase catalytic subunit protein (see, e.g., Lendvay et 
al, 1996, Genetics, 144:1399). 

10 In one aspect, the present invention provides reagents and methods for 

identifying and cloning novel TRTs using: nucleic acid probes and primers generated or 
derived from the TRT polynucleotides disclosed (e.g., for cloning TRT genes and 
cDNAs); antibodies that specifically recognize the motifs or motif sequences or other 
TRT epitopes (e.g., for expression cloning TRT genes or purification of TRT proteins); 

15 by screening computer databases; or other means. For example, as described in 

Example 1, PCR (polymerase chain reaction) amplification of S. pombe DNA was 
carried out with degenerate-sequence primers designed from the Euplotes pi 23 RT 
motifs B' and C. Of four prominent products generated, one encoded a peptide 
sequence homologous to Euplotes pi 23 and S. cerevisiae Est2p. Using this PCR 

20 product as a probe, the complete sequence of the S. pombe TRT homologue was 
obtained by screening of S. pombe cDNA and genomic libraries and amplifying S. 
pombe RNA by reverse transcription and PCR (RT-PCR). The complete sequence of 
the S. pombe gene ("trtl"; GenBank Accession No. AF015783; Figure 15) revealed that 
homology with pi 23 and Est2p was especially high in the reverse transcriptase motifs. 

25 S. pombe trtl is also referred to as tezl . 

Amplification using degenerate primers derived from the telomerase RT 
motifs was also used to obtain TRT gene sequences in Oxytricha trifallax and 
Tetrahymena thermophila, as described in Example 1. 

The Euplotes pi 23, S. pombe trtl, and S. cerevisiae Est2p nucleic acid 

30 sequences of the invention were used in a search of a computerized database of human 
expressed sequence tags (ESTs) using the program BLAST (Altschul et al, 1990, J. 
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Mol Biol 215:403). Searching this database with the Est2p sequence did not indicate a 
match, but searching with pl23 and trtl sequences identified a human EST (Genbank 
accession no. AA281296; see SEQUENCE ID NO: 8), as described in Example 1, 
putatively encoding a homologous protein. Complete sequencing of the cDNA clone 
5 containing the EST (hereinafter, "clone 712562"; see SEQUENCE ID NO: 3) showed 
that seven RT motifs were present. However, this clone did not encode a contiguous 
human TRT with all seven motifs, because motifs B\ C, D, and E were contained in a 
different open reading frame (ORF) than the more NH 2 -terminal motifs. In addition, 
the distance between motifs A and B f was substantially shorter than that of the three 
10 previously characterized TRTs. Clone 712562 was obtained from the I.M.A.G.E. 
Consortium; Lennon et al., 1996, Genomics 33:151. 

A cDNA clone, pGRN121, encoding a functional hTRT (see Figure 16, 
SEQUENCE ID NO: 1) was isolated from a cDNA library derived from the human 293 
cell line as described in Example 1. Comparing clone 712562 with pGRN121 showed 
1 5 that clone 712562 has a 1 82 base pair (see Figure 24, SEQUENCE ID NO: 9) deletion 
between motifs A and B\ The additional 182 base pairs present in pGRN121 place all 
of the TRT motifs in a single open reading frame, and increase the spacing between the 
motif A and motif B* regions to a distance consistent with the other known TRTs. As is 
described infra in the Examples (e.g., Example 7), SEQUENCE ID NO: 1 encodes a 
20 catalytically active telomerase protein having the sequence of SEQUENCE ID NO: 2. 
The polypeptide of SEQUENCE ID NO: 2 has 1 132 residues and a calculated 
molecular weight of about 127 kilodaltons (kD). 

As is discussed infra, and described in Example 9, infra, TRT cDNAs 
possessing the 182 basepair deletion characteristic of the clone 712562 are detected 
25 following reverse transcription of RNA from telomerase-positive cells (e.g., testis and 
293 cells). hTRT RNAs lacking this 182 base pair sequence are referred to generally as 
"A 182 variants" and may represent one, two, or several species. Although the hTRT 
variants lacking the 182 basepair sequence found in the pGRN121 cDNA are unlikely 
to encode a fully active telomerase catalytic enzyme, they may play a role in telomerase 
30 regulation, as discussed infra, and/or have partial telomerase activity, such as telomere 
binding or hTR binding activity, as discussed infra. 
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Thus, in one aspect, the present invention provides an isolated 
polynucleotide with a sequence of a naturally occurring human TRT gene or mRNA 
including, but not limited to, a polynucleotide having the sequence as set forth in Figure 
16 (SEQUENCE ID NO: 1). In a related aspect, the invention provides a 

5 polynucleotide encoding an hTRT protein, fragment, variant or derivative. In another 
related aspect, the invention provides sense and antisense nucleic acids that bind to an 
hTRT gene or mRNA. The invention further provides hTRT proteins, whether 
synthesized or purified from natural sources, as well as antibodies and other agents that 
specifically bind an hTRT protein or a fragment thereof The present invention also 

10 provides many novel methods, including methods that employ the aforementioned 
compositions, for example, by providing diagnostic and prognostic assays for human 
diseases, methods for developing therapeutics and methods of therapy, identification of 
telomerase-associated proteins, and methods for screening for agents capable of 
activating or inhibiting telomerase activity. Numerous other aspects and embodiments 

1 5 of the invention are provided infra. 

One aspect of the invention is the use of a polynucleotide that is at 
least ten nucleotides to about 10 kb or more in length and comprises a contiguous 
sequence of at least ten nucleotides that is identical or exactly complementary to a 
contiguous sequence in a naturally occurring hTRT gene or hTRT mRNA in assaying 

20 or screening for an hTRT gene sequence or hTRT mRNA, or in preparing a 
recombinant host cell. 

A further aspect of the invention is the use of an agent increasing 
expression of hTRT in the manufacture of a medicament for the treatment of a 
condition addressed by increasing proliferative capacity of a vertebrate cell, 

25 optionally the medicament being for inhibiting the effects of aging. 

Yet a further aspect of the invention is the use of an inhibitor of 
telomerase activity in the manufacture of a medicament for the treatment of a 
condition associated with an elevated level of telomerase activity within a human cell. 
The proteins, variants and fragments of the invention, and the 

30 encoding polynucleotides or fragments, are also each provided in a further aspect of 
this invention for use as a pharmaceutical. 
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The invention further includes the use of a protein, variant or 
fragment, or of a polynucleotide or fragment, in each case as defined herein, in the 
manufacture of a medicament, for example in the manufacture of a medicament for 
inhibiting an effect of aging or cancer. 
5 Another aspect of the invention is a polynucleotide selected from: 

(a) the DNA having a sequence as set forth in Figure 16; 

(b) a polynucleotide of at least 10 nucleotides which hybridizes to the 
foregoing DNA and which codes for an hTRT protein or variant or which hybridizes to 
a coding sequence for such a variant; and, 

1 o (c) DNA sequences which are degenerate as a result of the genetic code 

to the DNA sequences defined in (a) and (b) and which code for an hTRT polypeptide 
or variant. 

In certain embodiments of the present invention, the hTRT 
polynucleotides are other than the 389 nucleotide polynucleotide of SEQUENCE ID 

15 NO:8 and/or other than clone 712562, the plasmid containing an insert, the sequence of 
which insert is shown in Figure 18 (SEQUENCE ID NO:3). 

The description below is organized by topic. Part II further describes 
amino acid motifs characteristic of TRT proteins, as well as TRT genes encoding 
proteins having such motifs. Parts III-VI describe, inter alia, nucleic acids, proteins, 

20 antibodies and purified compositions of the invention with particular focus on human 
TRT related compositions. Part VII describes, inter alia, methods and compositions of 
the invention useful for treatment of human disease. Part VIII describes production and 
identification of immortalized human cell lines. Part IX describes, inter alia, uses of 
the nucleic acids, polynucleotides, and other compositions of the invention for 

25 diagnosis of human diseases. Part X describes, inter alia, methods and compositions of 
the invention useful for screening and identifying agents and treatments that modulate 
(e.g., inhibit or promote) telomerase activity or expression. Part XI describes, inter 
alia, transgenic animals (e.g., telomerase knockout animals and cells). Part XII is a 
glossary of terms used in Parts I-XI. Part XIII describes examples relating to specific 

30 embodiments of the invention. The organization of the description of the invention by 
topic and subtopic is to provide clarity, and not to be limiting in any way. 
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II. TRT GENES AND PROTEINS 

The present invention provides isolated and/or recombinant genes and 
proteins having a sequence of a telomerase catalytic subunit protein (i.e., telomerase 
reverse transcriptase), including, but not limited to, the naturally occurring forms of 
5 such genes and proteins in isolated or recombinant form. Typically, TRTs are large, 
basic, proteins having reverse transcriptase (RT) and telomerasespecifw (T) amino 
acid motifs, as disclosed herein. Because these motifs are conserved across diverse 
organisms, TRT genes of numerous organisms may be obtained using the methods of 
the invention or identified using primers, nucleic acid probes, and antibodies of the 

1 0 invention, such as those specific for one or more of the motif sequences. 

The seven RT motifs found in TRTs, while similar to those found in 
other reverse transcriptases, have particular hallmarks. For example, as shown in 
Figure 4, within the TRT RT motifs there are a number of amino acid substitutions 
(marked with arrows) in residues highly conserved among the other RTs. For example, 

15 in motif C the two aspartic acid residues (DD) that coordinate active site metal ions 
(see, Kohlstaedt et al., 1992, Science 256:1783; Jacobo-Molina et al, 1993, Proc. 
Natl Acad Set U.S.A. 90:6320; Patel et al., 1995, Biochemistry 34:5351) occur in the 
context hxDD(F/Y) in the telomerase RTs compared to (F/Y)xDDh in the other RTs 
(where "h" is a hydrophobic amino acid, and "x" is any amino acid; see Xiong et al., 

20 1990, EMBOJ. 9:3353; Eickbush, in The Evolutionary Biology of Viruses, (S. Morse, 
Ed., Raven Press, NY, p. 121, 1994)). Another systematic change characteristic of the 
telomerase subgroup occurs in motif E, where WxGxSx is a consensus sequence or is 
conserved among the telomerase proteins, whereas hLGxxh is characteristic of other 
RTs (Xiong et al., supra; Eickbush supra). This motif E is called the "primer grip", and 

25 mutations in this region have been reported to affect RNA priming but not DNA 

priming (Powell et al., 1997, J. Biol Chem. 272:13262). Because telomerase requires 
a DNA primer (e.g., the chromosome 3' end), it is not unexpected that telomerase 
should differ from other RTs in the primer grip region. In addition, the distance 
between motifs A and B' is longer in the TRTs than is typical for other RTs, which may 

30 represent an insertion within the "fingers" region of the structure which resembles a 
right hand (Figure 3; see Kohlstaedt et al., supra; Jacobo-Molina et al., supra; and 



23 



Patel et aL, supra). 

Moreover, as noted supra, Motif T is an additional hallmark of TRT 
proteins. This Motif T, as shown, for example in Figure 4 (W-L-X-Y-X-X-h-h-X-h-h- 
X-p-F-F-Y-X-T-E-X-p-X-X-X-p-X-X-X-Y-X-R-K-X-X-W [X is any amino acid, h is 
5 hydrophobic, p is polar]), comprises a sequence that can be described using the 
formula: 

Trp-R r X 7 -R r R r R 2 -X-Phe-Phe-Tyr-X-Thr-Glu 
-Xg.9-R3-R3-Arg-R4-X2.Trp 
where X is any amino acid and the subscript refers to the number of consecutive 
10 residues, Rj is leucine or isoleucine, R 2 is glutamine or arginine, R 3 is phenyalanine or 
tyrosine, and R4 is lysine or histidine. 

The T motif can also be described using the formula: 

Trp-R r X 4 -h-h-X-h-h-R 2 -p-Phe-Phe-Tyr-X-Thr-Glu- 
X-p-X 3 -p-X 2 _ 3 - R 3 -R 3 -Arg-R4-X 2 -Trp 
15 where X is any amino acid and a subscript refers to the number of consecutive residues, 
Ri is leucine or isoleucine, R 2 is glutamine or arginine, R 3 is phenyalanine or tyrosine, 
R« is lysine or histidine, h is a hydrophobic amino acid selected from Ala, Leu, He, Val, 
Pro, Phe, Trp, and Met, and p is a polar amino acid selected from Gly, Ser, Thr, Tyr, 
Cys, Asn and Gin. 

20 In one embodiment, the present invention provides isolated naturally 

occurring and recombinant TRT proteins comprising one or more of the motifs 
illustrated in Figure 11, e.g., 

Motif T W-X 12 -FFY-X-TE-X 10 . U -R-X 3 -WOC 7 -I 
Motif T E-X 2 -V-X 
25 Motif 1 X 3 -R-X 2 -P-K-X 3 , or, alternatively, h-R-h-X-P-K 

Motif 2 X-R-X-I-X or, alternatively, (F/L)-R-h-I-X 2 -h 
Motif A X 4 -F-X 3 -D-X 4 -YD-X 2 or, alternatively, P-X-L-Y-F-h-X-h-D-h- 
X 2 -C-Y-D-X-I 

Motif B' Y-X 4 -G-X 2 -QG-X 3 -S-X 8 or, alternatively, K-X-Y-X-Q-X 2 -G-I- 
30 P-Q-G-S-X-L-S-X-h-L 

Motif C X6-DD-X-L-X 3 or, alternatively, L-L-R-L-X-D-D-X-L-h-I-T 



When the TRT protein shown contains more than one TRT motif, the order (NH2 - 
>COOH) is as shown in Figure 1 1 . 

In one embodiment, the present invention provides isolated naturally 
occurring TRT proteins comprising the following supermotif: 

PK-X 4 _ 10 -R-X4-X^ 
Xfi-DD-X-^-X^M-Xn-K 

It will be apparent to one of skill that, provided with the reagents, 
including the TRT sequences disclosed herein for those reagents and the methods and 
guidance provided herein (including specific methodologies described infra), TRT 
genes and proteins can be obtained, isolated and produced in recombinant form by one 
of ordinary skill. For example, primers (e.g., degenerate amplification primers) are 
provided that hybridize to gene sequences encoding RT and T motifs characteristic of 
TRT. For example, one or more primers or degenerate primers that hybridize to 
sequences encoding the FFYXTE region of the T motif, other TRT motifs (as discussed 
infra), or combinations of motifs or consensus sequences, can be prepared based on the 
codon usage of the target organism, and used to amplify the TRT gene sequence from 
genomic DNA or cDNA prepared from the target organism. Use of degenerate primers 
is well known in the art and entails use of sets of primers that hybridize to the set of 
nucleic acid sequences that can potentially encode the amino acids of the target motif, 
taking into account codon preferences and usage of the target organism, and by using 
amplification (e.g., PCR) conditions appropriate for allowing base mismatches in the 
annealing steps of PCR. Typically two primer sets are used; however, single primer 
(or, in this case, a single degenerate primer set) amplification systems are well known 
and may be used to obtain TRT genes. 

Table 1 provides illustrative primers of the invention that may be used to 
amplify novel TRT nucleic acids, particularly those from vertebrates (e.g., humans and 
other mammals). "N" is an equimolar mixture of all four nucleotides, and nucleotides 

25 



within parentheses are equimolar mixtures of the specified nucleotides. 



TABLE 1 

ILLUSTRATIVE DEGENERATE PRIMERS FOR AMPLIFICATION 
OF TRT NTTCT.FJC ACIDS 



motif 



direction 5'- sequence -3' 
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a FFYVTE Forward TT(CT)TT(CT)TA(CT)GTNACNGA 
b FFYVTE Reverse TCNGTNAC(GA)TA(GA)AA(GA)AA 



c RFIPKP Forward (CA)GNTT(CT)AT(ACT)CCNAA(AG)CC 

d RFIPKP Reverse GG(TC)TTNGG(TGA)AT(GA)AANC 

15 e AYDTI Forward GCNTA(CT)GA(CT)ACNAT 

f AXPTI Reverse TANGT(GA)TC(GA)TANGC 
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g GIPOG Forward GGNAT(ACT)CCNCA(AG)GG 

h GJPQG.S Reverse (GC)(AT)NCC(TC)TGNGG(TGA)ATNCC 



i LVDDJL Forward 
j DDFLLVT Reverse 



(CT)TNGTNGA(CT)GA(CT)TT(CT)(CT)T 
GTNACNA(GA)NA(GA)(GA)AA(GA)TC(GA)TC 



Preferred primer combinations (y = yes, n = no) 
25 Reverse 

h d f h i 
n y y y y 
n n y y y 
n n n y y 
30 g- nnnny 



Forward 
a - 
c - 
e - 

g" 
i - 



n n n n n 



In one embodiment, an amplified TRT nucleic acid is used as a hybridization probe for 
35 colony hybridization to a library (e.g., cDNA library) made from the target organism, 
such that a nucleic acid having the entire TRT protein coding sequence, or a substantial 
portion thereof, is identified and isolated or cloned. Reagents and methods such as 



26 



those just described were used in accordance with the methods described herein to 
obtain TRT gene sequences of Oxytricha trifallax and Tetrahymena thermophila, as 
described in detail infra. It will be recognized that following cloning of a previously 
uncharacterized TRT gene, the sequence can be determined by routine methods and the 
encoded polypeptide synthesized and assayed for a TRT activity, such as telomerase 
catalytic activity (as described herein and/or by telomerase assays known in the art). 

It will also be apparent to those of skill that TRT genes may be cloned 
using any of a variety of cloning methods of the invention because the TRT motif 
sequences and the nucleic acids of the invention comprising such sequences can be 
used in a wide variety of such methods. For example, hybridization using a probe 
based on the sequence of a known TRT to DNA or other nucleic acid libraries from the 
target organism, as described in Example 1 can be used. It will be appreciated that 
degenerate PCR primers or their amplification products such as those described supra, 
may themselves be labeled and used as hybridization probes. In another embodiment, 
expression cloning methods are used. For example, one or more antibodies that 
specifically bind peptides that span a TRT motif or other TRT epitope, such as the 
FFYXTE motif can be employed to isolate a ribosomal complex comprising a TRT 
protein and the mRNA that encodes it. For generating such antibodies of the invention, 
the peptide immunogens are typically between 6 and 30 amino acids in length, more 
often about 10 to 20 amino acids in length. The antibodies may also be used to probe a 
cDNA expression library derived from the organism of interest to identify a clone 
encoding a TRT sequence. In another embodiment, computer searches of DNA 
databases for DNAs containing sequences conserved with known TRTs can also be 
used to identify a clone comprising TRT sequence. 

In one aspect, the present invention provides compositions comprising 
an isolated or recombinant polypeptide having the amino acid sequence of a naturally 
occurring TRT protein. Usually the naturally occurring TRT has a molecular weight of 
between about 80,000 daltons (D) and about 150,000 D, most often between about 
95,000 D and about 130,000 D. Typically, the naturally occurring TRT has a net 
positive charge at pH 7 (calculated pi typically greater than 9). In one embodiment, the 
polypeptide exhibits a telomerase activity as defined herein. In a related embodiment, 
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the polypeptide has a TRT-specific region (T motif) sequence and exhibits a telomerase 
activity. The invention further provides fragments of such polypeptides. The present 
invention also provides isolated or recombinant polynucleotide having the sequence of 
a naturally occurring gene encoding a TRT protein. The invention provides regents 
useful for isolating sequence of a TRT from nonvertebrate (such as a yeast) and 
vertebrates, such as mammals (e.g., murine or human). The isolated polynucleotide 
may be associated with other naturally occurring or recombinant or synthetic vector 
nucleic acid sequences. Typically, the isolated nucleic acid is smaller than about 300 
kb, often less than about 50 kb, more often less than about 20 kb, frequently less than 
about 10 kb and sometimes less than about 5 kb or 2 kb in length. In some 
embodiments the isolated TRT polynucleotide is even smaller, such as a gene fragment, 
primer, or probe of less than about 1 kb or less than 0.1 kb. 

III. NUCLEIC ACIDS 

A) GENERALLY 

The present invention provides isolated and recombinant nucleic acids 
having a sequence of a polynucleotide encoding a telomerase catalytic subunit protein 
(TRT), such as a recombinant TRT gene from Euplotes, Tetrahymena, S. pombe or 
humans. Exemplary polynucleotides are provided in Figure 13 (Euplotes); Figure 15 
(S. pombe) and Figure 16 (human, GenBank Accession No. AF015950). The present 
invention provides sense and anti-sense polynucleotides having a TRT gene sequence, 
including probes, primers, TRT-protein-encoding polynucleotides, and the like. 

B) HUMAN TRT 

The present invention provides nucleic acids having a sequence of a 
telomerase catalytic subunit from humans (i.e., hTRT). 

In one aspect, the invention provides a polynucleotide having a sequence 
or subsequence of a human TRT gene or RNA. In one embodiment, the polynucleotide 
of the invention has a sequence of SEQUENCE ID NO: 1 shown in Figure 16 or a 
subsequence thereof. In another embodiment, the polynucleotide has a sequence of 
SEQUENCE ID NO: 3 (Figure 18), SEQUENCE ID NO: 4 (Figure 20), or 

28 



# • 

subsequences thereof. The invention also provides polynucleotides with substantial 
sequence identity to the hTRT nucleic acid sequences disclosed herein, e.g., including 
but not limited to SEQUENCE ID NOS: 1 [Figure 16], 4 [Figure 20], 6 [Figure 21], 
and 7 [Figure 12]). Thus, the invention provides naturally occurring alleles of human 
TRT genes and variant polynucleotide sequences having one or more nucleotide 
deletions, insertions or substitutions relative to an hTRT nucleic acid sequence 
disclosed herein. As described infra, variant nucleic acids may be produced using the 
recombinant or synthetic methods described below or by other means. 

The invention also provides isolated and recombinant polynucleotides 
having a sequence from a flanking region of a human TRT gene. Such polynucleotides 
include those derived from genomic sequences of untranslated regions of the hTRT 
mRNA. An exemplary genomic sequence is shown in Figure 21 (SEQUENCE ID NO: 
6). As described in Example 4, SEQUENCE ID NO: 6 was obtained by sequencing a 
clone, isolated from a human genomic library. Lambda G05 contains a 15 

kilobasepair (kbp) insert including approximately 13,000 bases 5' to the hTRT coding 
sequences. This clone contains hTRT promoter sequences and other hTRT gene 
regulatory sequences (e.g., enhancers). 

The invention also provides isolated and recombinant polynucleotides 
having a sequence from an intronic region of a human TRT gene. An exemplary 
intronic sequence is shown in Figure 12 (SEQUENCE ID NO: 7; see Example 3). In 
some embodiments, hTRT introns are included in "minigenes" for improved expression 
of hTRT proteins in eukaryotic cells. 

In a related aspect, the present invention provides polynucleotides that 
encode hTRT proteins or protein fragments, including modified, altered and variant 
hTRT polypeptides. In one embodiment, the encoded hTRT protein or fragment has an 
amino acid sequence as set forth in Figure 17 (SEQUENCE ID NO: 2), or with 
conservative substitutions of SEQUENCE ID NO: 2. In one embodiment, the encoded 
hTRT protein or fragment has substitutions that change an activity of the protein (e.g., 
telomerase catalytic activity). 

It will be appreciated that, as a result of the degeneracy of the genetic 
code, the nucleic acid encoding the hTRT protein need not have the sequence of a 
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naturally occurring hTRT gene, but that a multitude of polynucleotides can encode an 
hTRT polypeptide having an amino acid sequence of SEQUENCE ID NO: 2. The 
present invention provides each and every possible variation of nucleotide sequence 
that could be made by selecting combinations based on possible codon choices made in 
5 accordance with known triplet genetic codes, and all such variations are specifically 

disclosed hereby. Thus, although in some cases hTRT polypeptide-encoding nucleotide 
sequences that are capable of hybridizing to the nucleotide sequence of the naturally 
occurring sequence (under appropriately selected conditions of stringency) are 
preferred, it may be advantageous in other cases to produce nucleotide sequences 

10 encoding hTRT that employ a substantially different codon usage and so perhaps do not 
hybridize to nucleic acids with the naturally occurring sequence. 

In particular embodiments, the invention provides hTRT oligo- and 
polynucleotides that comprise a subsequence of an hTRT nucleic acid disclosed herein 
(e.g., SEQUENCE ID NOS: 1 and 6). The nucleic acids of the invention typically 

15 comprise at least about 10, more often at least about 12 or about 15 consecutive bases 
of the exemplified hTRT polynucleotide. Often, the nucleic acid of the invention will 
comprise a longer sequence, such as at least about 25, about 50, about 100, about 200, 
or at least about 500 to 3000 bases in length, for example when expression of a 
polypeptide, or full length hTRT protein is intended. 

20 In still other embodiments, the present invention provides "Al 82 hTRT" 

polynucleotides having a sequence identical or complementary to naturally occurring or 
non-naturally occurring hTRT polynucleotides such as SEQUENCE ID NO: 3 or 
SEQUENCE ID NO; 4, which do not contain the 182 nucleotide sequence 
(SEQUENCE ID NO: 9 [Figure 24]) found in pGRN121 (and also absent in clone 

25 712562). These polynucleotides are of interest, in part, because they encode 

polypeptides that contain different combinations or arrangements of TRT motifs than 
found in the "full-length" hTRT polypeptide (SEQUENCE ID NO: 2) such as is 
encoded by pGRN121. As discussed infra, it is contemplated that these polypeptides 
may play a biological role in nature (e.g., in regulation of telomerase expression in 

30 cells) and/or find use as therapeutics (e.g., as dominant-negative products that inhibit 
function of wild-type proteins), or have other roles and uses, e.g. as described herein. 
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For example, in contrast to the polypeptide encoded by pGRN121, clone 
712562 encodes a 259 residue protein with a calculated molecular weight of 
approximately 30 kD (hereinafter, "712562 hTRT"). The 712562 hTRT polypeptide 
(SEQUENCE ID NO: 10 [Figure 19]) contains motifs T, 1, 2, and A, but not motifs B', 

5 C, D and E (See Figure 4). Similarly, a variant hTRT polypeptide with therapeutic and 
other activities may be expressed from a nucleic acid similar to the pGRN121 cDNA 
but lacking the 182 basepairs missing in clone 712562, e.g., having the sequence shown 
in Figure 20 (SEQUENCE ID NO: 4). This nucleic acid (hereinafter, "pro90 hTRT")? 
which may be synthesized using routine synthetic or recombinant methods as described 

1 0 herein, encodes a protein of 807 residues (calculated molecular weight of 

approximately 90 kD) that shares the same amino terminal sequence as the hTRT 
protein encoded by SEQUENCE ID NO: 1, but diverges at the carboxy-terminal region 
(the first 763 residues are common, the last 44 residues of pro90 hTRT are different 
than "full-length" hTRT). The pro90 hTRT polypeptide contains motifs T, 1, 2, and A, 

1 5 but not motifs B, C, D, E, and thus may have some, but not likely all telomerase 
activities. 

C) PRODUCTION OF HUMAN TRT NUCLEIC ACIDS 

The polynucleotides of the invention have numerous uses including, but 
20 not limited to, expression of polypeptides encoding hTRT or fragments thereof, use as 
sense or antisense probes or primers for hybridization and/or amplification of naturally 
occurring hTRT genes or RNAs (e.g. for diagnostic or prognostic applications), and as 
therapeutic agents (e.g., in antisense, triplex, or ribozyme compositions). As will be 
apparent upon review of the disclosure, these uses will have enormous impact on the 
25 diagnosis and treatment of human diseases relating to aging, cancer, and fertility as well 
as the growth, reproduction, and manufacture of cell-based products. As described in 
the following sections, the hTRT nucleic acids of the invention may be made (e.g., 
cloned, synthesized, or amplified) using techniques well known in the art. 
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1) CLONING, AMPLIFICATION, AND RECOMBINANT 
PRODUCTION 

In one embodiment, hTRT genes or cDNAs are cloned using a nucleic 
acid probe that specifically hybridizes to an hTRT mRNA, cDNA, or genomic DNA. 
5 One suitable probe for this purpose is a polynucleotide having all or part of the 

sequence provided in Figure 16 (SEQUENCE ID NO: 1), such as a probe comprising a 
subsequence thereof. Typically, the target hTRT genomic DNA or cDNA is Iigated 
into a vector (e.g., a plasmid, phage, virus, yeast artificial chromosome, or the like) and 
may be isolated from a genomic or cDNA library (e.g., a human placental cDNA 

10 library). Once an hTRT nucleic acid is identified, it can be isolated according to 
standard methods known to those of skill in the art. An illustrative example of 
screening a human cDNA library for the hTRT gene is provided in Example 1 ; 
similarly, an example of screening a human genomic library is found in Examples 3 and 
4. Cloning methods are well known and are described, for example, in Sambrook et aL, 

1 5 (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold 
Spring Harbor Laboratory hereinafter, "Sambrook"); Berger and Kimmel, (1987) 
Methods in Enzymology, Vol. 152: Guide to Molecular Cloning Techniques, 
San Diego: Academic Press, Inc.; Ausubel et al., Current Protocols in Molecular 
Biology, Greene Publishing and Wiley-Interscience, New York (1997); Cashion et aL, 

20 U.S. Patent No. 5,017,478; and Carr, European Patent No. 0,246,864. 

The invention also provides hTRT genomic or cDNA nucleic acids 
isolated by amplification methods such as the polymerase chain reaction (PCR). In one 
embodiment, hTRT protein coding sequence is amplified from an RNA or cDNA 
sample (e.g., double stranded placental cDNA (Clontech, Palo Alto CA)) using the 

25 primers 5 f -GTGAAGGCACTGTTCAGCG-3 , ("TCP 1.1") and 

5-CGCGTGGGTGAGGTGAGGTG-3 ("TCP 1.15"). In some embodiments a third 
primer or second pair of primers may be used, e.g., for "nested PCR", to increase 
specificity. One example of a second pair of primers is 5'- 
CTGTGCTGGGCCTGGACGATA-3' ("billTCP6") and 5'- 

30 AGCTTGTTCTCCATGTCGCCGTAG-3' ("TCP1.14"). It will be apparent to those of 
skill that numerous other primers and primer combinations, useful for amplification of 
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hTRT nucleic acids are provided by the present invention. 

Moreover, the invention provides primers that amplify any specific 
region (e.g., coding regions, promoter regions, and/or introns) or subsequence of hTRT 
genomic DNA, cDNA or RNA. For example, the hTRT intron at position 274/275 of 
5 SEQUENCE ID NO: 1 (see Example 3) may be amplified (e.g., for detection of 
genomic clones) using primers TCP 1 .57 and TCP 1 .52 (primer pair 1) or primers 
TCP 1 .49 and TCP 1 .50 (primer pair 2). (Primer names refer to primers listed in Table 
2, infra.) The primer pairs can be used individually or in a nested PCR where primer 
set 1 is used first. Another illustrative example relates to primers that specifically 

10 amplify and so detect the 5' end of the hTRT mRNA or the exon encoding the 5 5 end of 
hTRT gene (e.g., to assess the size or completeness of a cDNA clone). The following 
primer pairs are useful for amplifying the 5' end of hTRT: primers K320 and K321 
(primer pair 3); primers K320 and TCP1.61 (primer pair 4); primers K320 and K322 
(primer pair 5). The primer sets can be used in a nested PCR in the order set 5, then set 

15 4 or set 3, or set 4 or set 5, then set 3. Yet another illustrative example involves primers 
chosen to amplify or detect specifically the conserved hTRT TRT motif region 
comprising approximately the middle third of the mRNA (e.g., for use as a 
hybridization probe to identify TRT clones from, for example, nonhuman organisms). 
The following primer pairs are useful for amplifying the TRT motif region of hTRT 

20 nucleic acids: primers K304 and TCP 1 .8 (primer pair 6), or primers LT1 and TCP 1.15 
(primer pair 7). The primer sets can be used in a nested PCR experiment in the order 
set 6 then set 7. 

Suitable PCR amplification conditions are known to those of skill and 
include (but are not limited to) 1 unit Taq polymerase (Perkin Elmer, Norwalk CT), 100 

25 \iM each dNTP (dATP, dCTP, dGTP, dTTP), lx PCR buffer (50 mM KC1, 10 mM 

Tris, pH 8.3 at room temperature, 1.5 mM MgCl 2 , 0.01% gelatin) and 0.5 \iM primers, 
with the amplification run for about 30 cycles at 94° for 45 sec, 55 ° for 45 sec and 72° 
for 90 sec. It will be recognized by those of skill in the art that other thermostable 
DNA polymerases, reaction conditions, and cycling parameters will also provide 

30 suitable amplification. Other suitable in vitro amplification methods that can be used to 
obtain hTRT nucleic acids include, but are not limited to, those herein, infra. Once 



33 




amplified, the hTRT nucleic acids can be cloned, if desired, into any of a variety of 
vectors using routine molecular biological methods or detected or otherwise utilized in 
accordance with the methods of the invention. 

One of skill will appreciate that the cloned or amplified hTRT nucleic 
5 acids obtained as described above can be prepared or propagated using other methods, 
such as chemical synthesis or replication by transformation into bacterial systems, such 
as E. coli (see, e.g., Ausubel et aL, supra), or eukaryotic, such as mammalian, 
expression systems. Similarly, hTRT RNA can be expressed in accordance with the 
present in vitro methods, or in bacterial systems such as E. coli using, for example, 

1 0 commercially available vectors containing promoters recognized by an RNA 
polymerase such as T7, T3 or SP6, or transcription of DNA generated by PCR 
amplification using primers containing an RNA polymerase promoter. 

The present invention further provides altered or modified hTRT nucleic 
acids. It will be recognized by one of skill that the cloned or amplified hTRT nucleic 

1 5 acids obtained can be modified (e.g., truncated, derivatized, altered) by methods well 
known in the art (e.g., site-directed mutagenesis, linker scanning mutagenesis) or 
simply synthesized de novo as described below. The altered or modified hTRT nucleic 
acids are useful for a variety of applications, including, but not limited to, facilitating 
cloning or manipulation of an hTRT gene or gene product, or expressing a variant 

20 hTRT gene product. For example, in one embodiment, the hTRT gene sequence is 

altered such that it encodes an hTRT polypeptide with altered properties or activities, as 
discussed in detail in infra, for example, by mutation in a conserved motif of hTRT. In 
another illustrative example, the mutations in the protein coding region of an hTRT 
nucleic acid may be introduced to alter glycosylation patterns, to change codon 

25 preference, to produce splice variants, remove protease-sensitive sites, create antigenic 
domains, modify specific activity, and the like. In other embodiments, the nucleotide 
sequence encoding hTRT and its derivatives is changed without altering the encoded 
amino acid sequences, for example, the production of RNA transcripts having more 
desirable properties, such as increased translation efficiency or a greater or a shorter 

30 half-life, compared to transcripts produced from the naturally occurring sequence. In 
yet another embodiment, altered codons are selected to increase the rate at which 
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expression of the peptide occurs in a particular prokaryotic or eukaryotic expression 
host in accordance with the frequency with which particular codons are utilized by the 
host. Useful in vitro and in vivo recombinant techniques that can be used to prepare 
variant hTRT polynucleotides of the invention are found in Sambrook et ah and 
5 Ausubel et aL, both supra. 

As noted supra, the present invention provides nucleic acids having 
flanking (5 f or 3 1 ) and intronic sequences of the hTRT gene. The nucleic acids are of 
interest, inter alia, because they contain promoter and other regulatory elements 
involved in hTRT regulation and useful for expression of hTRT and other recombinant 

1 0 proteins or RN A gene products. It will be apparent that, in addition to the nucleic acid 
sequences provided in SEQUENCE ID NOS: 6 and 7, additional hTRT intron and 
flanking sequences may be readily obtained using routine molecular biological 
techniques. For example, additional hTRT genomic sequence may be obtained from 
Lambda clone G<&5 (ATCC Accession No. 209024), described supra and in Example 4. 

15 Still other hTRT genomic clones and sequences may be obtained by screening a human 
genomic library using an hTRT nucleic acid probe having a sequence or subsequence 
from SEQUENCE ID NO: 1. Additional clones and sequences (e.g., still further 
upstream) may be obtained by using labeled sequences or subclones derived from 
AG<£5 to probe appropriate libraries. Other useful methods for further characterization 

20 of hTRT flanking sequences include those general methods described by Gobinda et aL, 
1 993, PCR Metk Applic. 2:318; Triglia et aL, 1 988, Nucleic Acids Res. 16:8186; 
Lagerstrom et aL, 1 99 1 , PCR Methods Applic. 1 : 1 1 1 ; and Parker et aL, 1 99 1 , Nucleic 
Acids Res. 19:3055. 

Intronic sequences can be identified by routine means such as by 

25 comparing the hTRT genomic sequence with hTRT cDNA sequences (see, e.g, 5 

Example 3), by SI analysis (see Ausubel et aL, supra, at Chapter 4), or various other 
means known in the art. Intronic sequences can also be found in pre-mRNA (i.e., 
unspliced or incompletely spliced mRNA precursors), which may be amplified or 
cloned following reverse transcription of cellular RNA. 

30 When desired, the sequence of the cloned, amplified, or otherwise 

synthesized hTRT or other TRT nucleic acid can be determined or verified using DNA 
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sequencing methods well known in the art (see, e.g., Ausubel et aL, supra). Useful 
methods of sequencing employ such enzymes as the Klenow fragment of DNA 
polymerase I, Sequenase (US Biochemical Corp, Cleveland OH), Tag DNA polymerase 
(Perkin Elmer, Norwalk CT), thermostable T7 polymerase (Amersham, Chicago IL), or 
5 combinations of recombinant polymerases and proofreading exonucleases such as the 
ELONGASE Amplification System marketed by Gibco BRL (Gaithersburg MD). 
When sequencing or verifying the sequence of oligonucleotides (such as 
oligonucleotide made de novo by chemical synthesis), the method of Maxam and 
Gilbert may be preferred (Maxam and Gilbert, 1980, Meth Enz. 65:499; Ausubel et aL, 
10 supra, Ch. 7). 

The 5 ! untranslated sequences of hTRT or other TRT mRNAs can be 
determined directly by cloning a "full-length" hTRT or other cDNA using standard 
methods such as reverse transcription of mRNA, followed by cloning and sequencing 
the resulting cDNA. Preferred oligo(dT)-primed libraries for screening or amplifying 

1 5 full length cDNAs that have been size-selected to include larger cDNAs may be 
preferred. Random primed libraries are also suitable and often include a larger 
proportion of clones that contain the 5 1 regions of genes. Other well known methods 
for obtaining 5' RNA sequences, such as the RACE protocol described by Frohman et 
aL, 1988, Proa Nat Acad. Sci USA 85:8998, may also be used. If desired, the 

20 transcription start site of an hTRT or other TRT mRNA can be determined by routine 
methods using the nucleic acids provided herein (e.g., having a sequence of 
SEQUENCE ID NO: 1). One method is SI nuclease analysis (Ausubel et aL, supra) 
using a labeled DNA having a sequence from the 5' region of SEQUENCE ID NO: 1 . 

25 2) CHEMICAL SYNTHESIS OF NUCLEIC ACIDS 

The present invention also provides hTRT polynucleotides (RNA, DNA 
or modified) that are produced by direct chemical synthesis. Chemical synthesis is 
generally preferred for the production of oligonucleotides or for oligonucleotides and 
polynucleotides containing nonstandard nucleotides (e.g., probes, primers and antisense 
30 oligonucleotides). Direct chemical synthesis of nucleic acids can be accomplished by 
methods known in the art, such as the phosphotriester method of Narang et aL, 1 979, 
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Meth. Enzymol. 68:90; the phosphodiester method of Brown et al., Meth. Enzymol. 
68:109 (1979); the diethylphosphoramidite method of Beaucage et al., Tetra. Lett., 
22:1859 (1981); and the solid support method of U.S. Patent No. 4,458,066. Chemical 
synthesis typically produces a single stranded oligonucleotide, which may be converted 
5 into double stranded DNA by hybridization with a complementary sequence, or by 

polymerization with a DNA polymerase and an oligonucleotide primer using the single 
strand as a template. One of skill will recognize that while chemical synthesis of DNA 
is often limited to sequences of about 100 or 150 bases, longer sequences may be 
obtained by the ligation of shorter sequences or by more elaborate synthetic methods. 
1 o It will be appreciated that the hTRT (or hTR or other) polynucleotides 

and oligonucleotides of the invention can be made using nonstandard bases (e.g., other 
than adenine, cytidine, guanine, thymine, and uridine) or nonstandard backbone 
structures to provides desirable properties (e.g., increased nuclease-resistance, 
tighter-binding, stability or a desired TJ. Techniques for rendering oligonucleotides 
1 5 nuclease-resistant include those described in PCT publication WO 94/1 263 3 . A wide 
variety of useful modified oligonucleotides may be produced, including 
oligonucleotides having a peptide-nucleic acid (PNA) backbone (Nielsen et al., 1991, 
Science 254:1497) or incorporating 2-O-methyl ribonucleotides, phosphorothioate 
nucleotides, methyl phosphonate nucleotides, phosphotriester nucleotides, 
20 phosphorothioate nucleotides, phosphoramidates. Still other useful oligonucleotides 
may contain alkyl and halogen-substituted sugar moieties comprising one of the 
following at the 2' position: OH, SH, SCH 3 , F, OCN, OCH 3 OCH 3 , OCH 3 0(CH 2 ) n CH 3 , 
0(CH 2 ) n NH 2 or 0(CH 2 ) n CH 3 where n is from 1 to about 10; C, to C 10 lower alkyl, 
substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; CF 3 ; OCF 3 ; 0-, S-, or N-alkyl; 
25 0-, S-, or N-alkenyl; SOCH 3 ; S0 2 CH 3 ; ON0 2 ; N0 2 ; N 3 ; NH 2 ; heterocycloalkyl; 
heterocycloalkaryl; arninoalkylamino; polyalkylamino; substituted silyl; an RNA 
cleaving group; a cholesteryl group; a folate group; a reporter group; an intercalator; a 
group for improving the pharmacokinetic properties of an oligonucleotide; or a group 
for improving the pharmacodynamic properties of an oligonucleotide and other 
30 substituents having similar properties. Folate, cholesterol or other groups which 

facilitate oligonucleotide uptake, such as lipid analogs, may be conjugated directly or 
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via a linker at the 2' position of any nucleoside or at the 3 f or 5 1 position of the 3 1 - 
terminal or 5-terminal nucleoside, respectively. One or more such conjugates may be 
used. Oligonucleotides may also have sugar mimetics such as cyclobutyls in place of 
the pentofuranosyl group. Other embodiments may include at least one modified base 
5 form or "universal base 11 such as inosine, or inclusion of other nonstandard bases such 
as queosine and wybutosine as well as acetyl-, methyl-, thio- and similarly modified 
forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily 
recognized by endogenous endonucleases. The invention further provides 
oligonucleotides having backbone analogues such as phosphodiester, phosphorothioate, 

10 phosphorodithioate, methylphosphonate, phosphoramidate, alkyl phosphotriester, 
sulfamate, 3 f -thioacetal, methylene(methylimino), 3'-N-carbamate, morpholino 
carbamate, chiral-methyl phosphonates, nucleotides with short chain alkyl or cycloalkyl 
intersugar linkages, short chain heteroatomic or heterocyclic intersugar ("backbone") 
linkages, or CH 2 -NH-0-CH 2 , CH 2 -N(CH 3 )-OCH 2 , CH 2 -0-N(CH 3 )-CH 2 , 

1 5 CH 2 -N(CH 3 )-N(CH 3 )-CH 2 and 0-N(CH 3 )-CH 2 -CH 2 backbones (where phosphodiester 
is 0-P-0-CH 2 ), or mixtures of the same. Also useful are oligonucleotides having 
morpholino backbone structures (U.S. Patent No. 5,034,506). 

Useful references include Oligonucleotides and Analogues, A Practical 
Approach, edited by F. Eckstein, IRL Press at Oxford University Press (1991); 

20 Antisense Strategies, Annals of the New York Academy of Sciences, Volume 600, Eds. 
Baserga and Denhardt (NYAS 1992); Milligan et al., 9 July 1993, J. Med. Chem. 
36(14): 1923-1937; Antisense Research and Applications (1993, CRC Press), in its 
entirety and specifically Chapter 15, by Sanghvi, entitled "Heterocyclic base 
modifications in nucleic acids and their applications in antisense oligonucleotides." 

25 Antisense Therapeutics, ed. Sudhir Agrawal (Humana Press, Totowa, New Jersey, 
1996). 
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D) LABELING NUCLEIC ACIDS 

It is often useful to label the nucleic acids of the invention, for example, 
when the hTRT or other oligonucleotides or polynucleotides are to be used as nucleic 
acid probes. The labels (see infra) may be incorporated by any of a number of means 
5 well known to those of skill in the art. In one embodiment, an unamplified nucleic 
acid (e.g., mRNA, polyA mRNA, cDNA) is labeled. Means of producing labeled 
nucleic acids are well known to those of skill in the art and include, for example, nick- 
translation, random primer labeling, end-labeling (e.g. using a kinase), and chemical 
conjugation (e.g., photobiotinylation) or synthesis. In another embodiment, the label is 

1 0 simultaneously incorporated during an amplification step in the preparation of the 
sample nucleic acids. Thus, for example, polymerase chain reaction (PCR) or other 
nucleic acid amplification method with labeled primers or labeled nucleotides will 
provide a labeled amplification product. In another embodiment, transcription 
amplification using a labeled nucleotide (e.g. fluorescein-labeled UTP and/or CTP) 

1 5 incorporates a label into the transcribed nucleic acids. An amplification product may 
also, or alternatively, be labeled after the amplification is completed. 

E) ILLUSTRATIVE OLIGONUCLEOTIDES 

As noted supra and discussed in detail infra, oligonucleotides are used 
20 for a variety of uses including as primers, probes, therapeutic or other antisense 

oligonucleotides, triplex oligonucleotides, and numerous other uses as apparent from 
this disclosure. Table 2 provides certain illustrative specific oligonucleotides that may 
be used in the practice of the invention. It will be appreciated that numerous other 
useful oligonucleotides of the invention may be synthesized by one of skill, following 
25 the guidance provided herein. 

In Table 2, "seq" means that the primer has been used, or is useful, for 
sequencing; "PCR" means that the primer has been used, or is useful, for PCR; "AS" 
means that means that the primer has been used, or is useful for antisense inhibition of 
telomerase activity; "CL" means that the primer has been used, or is useful in cloning 
30 regions of hTRT genes or RNA, "mut" means that the primer has been used, or is 

useful for constructing mutants of hTRT genes or gene products. "UC" means "upper 
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case," and "Ic" means "lower case." Mismatches and insertions (relative to 
SEQUENCE ID NO: 1) are indicated by underlining; deletions are indicated by a 
It will be appreciated that nothing in Table 2 is intended to limit the use of any 
particular oligonucleotide to any single use or set of uses. 
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IV. TRT PROTEINS AND PEPTIDES 
A) GENERALLY 

The invention provides a wide variety of hTRT proteins useful for, inter 
alia, production of telomerase activity, inhibition of telomerase activity in a cell, 
5 induction of an anti-hTRT immune response, as a therapeutic reagent, as a standard or 
control in a diagnostic assay, as a target in a screen for compounds capable of activation 
or inhibition of an activity of hTRT or telomerase, and numerous other uses that will be 
apparent to one of skill or are otherwise described herein. The hTRT of the invention 
include functionally active proteins (useful for e.g., conferring telomerase activity in a 

10 telomerase-negative cell) and variants, inactive variants (useful for e.g., inhibiting 
telomerase activity in a cell), hTRT polypeptides, and telomerase RNPs (e.g., 
ribonucleoprotein complexes comprising the proteins) that exhibit one, several, or all of 
the functional activities of naturally occurring hTRT and telomerase, as discussed in 
greater detail for illustrative purposes, below. 

15 In one embodiment, the hTRT protein of the invention is a polypeptide 

having a sequence as set forth in Figure 17 (SEQUENCE ID NO: 2), or a fragment 
thereof. In another embodiment, the hTRT polypeptide differs from SEQUENCE ID 
NO: 2 by internal deletions, insertions, or conservative substitutions of amino acid 
residues. In a related embodiment, the invention provides hTRT polypeptides with 

20 substantial similarity to SEQUENCE ID NO: 2. The invention further provides hTRT 
polypeptides that are modified, relative to the amino acid sequence of SEQUENCE ID 
NO: 2, in some manner, e.g., truncated, mutated, derivatized, or fused to other 
sequences (e.g., to form a fusion protein). Moreover, the present invention provides 
telomerase RNPs comprising an hTRT protein of the invention complexed with a 

25 template RNA (e.g., hTR). In other embodiments, one or more telomerase-associated 
proteins is associated with hTRT protein and/or hTR. 

The invention also provides other naturally occurring hTRT species or 
nonnaturally occurring variants, such as proteins having the sequence of, or substantial 
similarity to SEQUENCE ID NO: 5 [ [Figure 20], SEQUENCE ID NO: 10 [Figure 19], 

30 and fragments, variants, or derivatives thereof. 

The invention provides still other hTRT species and variants. One 
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example of an hTRT variant may result from ribosome frameshifting of mRNA 
encoded by the clone 712562 (SEQUENCE ID NO: 3 [Figure 18]) or the pro90 variant 
hTRT shown in SEQUENCE ID NO: 4 [Figure 20] and so result in the synthesis of 
hTRT polypeptides containing all the TRT motifs (for a general example, see, e.g., 
5 Tsuchihashi et al., 1990, Proc. Natl Acad ScL USA 87:2516; Craigengen et al., 1987, 
Cell 50:1; Weiss, 1990, Cell 62:117). Ribosome frameshifting can occur when specific 
mRNA sequences or secondary structures cause the ribosome to "stall" and jump one 
nucleotide forwards or back in the sequence. Thus, a ribosome frameshift event on the 
712562 mRNA could cause the synthesis of an approximately 523 amino acid residue 

1 0 polypeptide. A ribosome frameshift event on the pro90 sequence could result in a 

protein with approximately 1071 residues. It will be appreciated that proteins resulting 
from ribosome frameshifting can also be expressed by synthetic or recombinant 
techniques provided by the invention. 

Human TRT proteins, peptides, and functionally equivalent proteins 

15 may be obtained by purification, chemical synthesis, or recombinant production, as 
discussed in greater detail below. 

B) TRT PROTEIN ACTIVITIES 

The TRT polypeptides of the invention (including fragments, variants, 
20 products of alternative alleles, and fusion proteins) can have one or more, or all of the 
functional activities associated with native hTRT. Except as noted, as used herein, an 
hTRT or other TRT polypeptide is considered to have a specified activity if the activity 
is exhibited by either the hTRT protein without an associated RNA (e.g., hTR) or in an 
hTRT-associated RNA (e.g., hTR) complex. The hTR-binding activity of hTRT is one 
25 example of an activity associated with the hTRT protein. Methods for producing 

complexes of nucleic acids (e.g., hTR) and the hTRT polypeptides of the invention are 
described infra. 

Modification of the hTRT protein (e.g., by chemical or recombinant 
means, including mutation or modification of a polynucleotide encoding the hTRT 
30 polypeptide or chemical synthesis of a polynucleotide that has a sequence different than 
a native polynucleotide sequence) to have a different complement of activities than 
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native hTRT can be useful in therapeutic applications or in screening for specific 
modulators of hTRT or telomerase activity . In addition, assays for various hTRT 
activities can be particularly useful for identification of agents (e.g., activity modulating 
agents) that interact with hTRT or telomerase to change telomerase activity. 
5 The activities of native hTRT, as discussed infra, include telomerase 

catalytic activity (which may be either processive or non-processive activity); 
telomerase processivity; conventional reverse transcriptase activity; nucleolytic 
activity; primer or substrate (telomere or synthetic telomerase substrate or primer) 
binding activity; dNTP binding activity; RNA (i.e., hTR) binding activity; and protein 
10 binding activity (e.g., binding to telomerase-associated proteins, telomere-binding 

proteins, or to a protein-telomeric DNA complex). It will be understood, however, that 
present invention also provides hTRT compositions without any particular hTRT 
activity but with some useful activity related to the hTRT or other TRT proteins (e.g., 
certain typically short immunogenic peptides, inhibitory peptides). 

15 

1) TELOMERASE CATALYTIC ACTIVITY 

As used herein, a polypeptide of the invention has "telomerase catalytic 
activity," when the polypeptide is capable of extending a DNA primer that functions as 
a telomerase substrate by adding a partial, one, or more than one repeat of a sequence 

20 (e.g., TTAGGG) encoded by a template nucleic acid (e.g., hTR). This activity may be 
processive or nonprocessive. Processive activity occurs when a telomerase RNP adds 
multiple repeats to a primer or telomerase before the DNA is released by the enzyme 
complex. Non-processive activity occurs when telomerase adds a partial, or only one, 
repeat to a primer and is then released. In vivo, however, a non-processive reaction 

25 could add multiple repeats by successive rounds of association, extension, and 

dissociation. This can occur in vitro as well, but it is not typically observed in standard 
assays due to the vastly large molar excess of primer over telomerase in standard assay 
conditions. 

To characterize an hTRT polypeptide as having non-processive activity, a 
30 conventional telomerase reaction is performed using conditions that favor a 

non-processive reaction, for example high temperatures (i.e., 35-40°C, typically 37°C), 
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low dGTP concentrations (1 \xM or less), high primer concentrations (5 pM or higher), 
and high dATP/TTP concentrations (2 mM or higher), with the temperature and dGTP 
typically having the greatest effect. To characterize an hTRT polypeptide as having 
processive activity, a conventional telomerase reaction is performed using conditions 
5 that favor a processive reaction (for example, 27-34°C, typically 30°C), high dGTP 

concentration (10 \iM or higher), low primer concentration (1 \xM or lower), and/or low 
dATP and TTP concentrations (0.3-1 mM) with temperature and dGTP typically 
concentration being the most critical. Alternatively, a TRAP assay (for processive or 
moderately processive activity) or the dot-blot and gel blot assays (for processive 

1 0 activity) may be used. The hTRT polypeptide of the invention can possess a 

non-processive activity, but not a processive activity (e.g., if an alteration of the hTRT 
polypeptide reduces or eliminates the ability to translocate), can be solely processive, or 
can possess both activities. 

a) Non-processive Activity 

15 A non-processive telomerase catalytic activity can extend the DNA 

primer from the position where the 3 T end anneals to the RNA template to the 5 ! end of 
the template sequence, typically terminating with the addition of the first G residue (as, 
for example, when the template is hTR). As shown below, the exact number of 
nucleotides added is dependent on the position of the 3 f terminal nucleotide of the 

20 primer in the TTAGGG repeat sequence. 

NONPROCESSIVE ACTIVITY 

i) TTAGGG t tag (DNA) 

25 3 1 AUCCCAAUC 5 1 (RNA) 

ii) TTAGggttag (DNA) 

31 AUCCCAAUC 5* ( RNA) 

30 In DNA, UC = primer, 1c = added nucleotides 

Thus, 4 nucleotides are added to the —TTAGGG primer (i) while 6 
nucleotides are added to the — TTAG primer (ii). The first repeat added by telomerase 
in a processive reaction is equivalent to this step; however, in a processive reaction 
35 telomerase performs a translocation step where the 3 f end is released and re-bound at 
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the y region of the template in a position sufficient to prime addition of another repeat 
(see Morin, 1997, Eur. 1 Cancer 33:750). 

A fully non-processive reaction produces only one band in a 
conventional assay using a single synthetic primer. Because this result could also be 
5 produced by other enzymes, such as a terminal transferase activity, it may be desirable 
in some applications to verify that the product is a result of a telomerase catalytic 
activity. A telomerase (comprising hTRT) generated band can be distinguished by 
several additional characteristics. The number of nucleotides added to the end of the 
primer should be consistent with the position of the primer 3' end. Thus, a --TTAGGG 

1 0 primer should have 4 nucleotides added and a — TTAG primer should have 6 

nucleotides added (see above). In practice, two or more sequence permuted primers can 
be used which have the same overall length but different 5' and 3' endpoints. As an 
illustrative example, the non-processive extension of primers 5- 
TTAGGGTTAGGGTTAGGG and S'-GTTAGGGTTAGGGTTAGG will generate 

1 5 products whose absolute length will be one nucleotide different (4 added to 5 - 
TTAGGGTTAGGGTTAGGG for a 22 nt total length, and 5 added to S'- 
GTTAGGGTTAGGGTTAGG for a 23 nt total length). The nucleotide dependence of 
the reaction should be consistent with the position of the primer terminus. Thus, a 
— TTAGGG primer product should require dG TP, TTP, and dATP, but not dCTP, and a 

20 — AGGGTT primer product should require dGTP and dATP, but not TTP or dCTP. 

The activity should be sensitive to RNAase or micrococcal nuclease pre-treatment (see 
Morin, 1989, Cell 59; 521) under conditions that will degrade hTR and so eliminate the 
template. 

25 b) Processive Activity 

In practice, a processive activity is easily observed by the 

appearance of a six nucleotide ladder in a conventional assay, TRAP assay, or gel-blot 

assay. A dot-blot assay can also be used, but no ladder is detected in such a method. 

The conventional assay is described in Morin, 1989, Cell 59:521, which is incorporated 
30 herein in its entirety and for all purposes. The TRAP assay is described in U.S. Patent 

No. 5,629,154; see also, PCT publication WO 97/15687, PCT publication WO 
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95/13381; Krupp et al. Nucleic Acids Res., 1997, 25: 919; and Wright et al., 1995, Nuc. 
Acids Res. 23:3794, each of which is incorporated herein in its entirety and for all 
purposes. The dot blot immunoassay is described in detail in co-pending U.S. Patent 
Application Serial Number 08/833,377, filed April 14, 1997, which is incorporated 

5 herein by reference in its entirety and for all purposes. The dot blot assay can be used 
in a format in which a non-processive activity, which does not add the 3 or more 
repeats required for stable hybridization of the (CCCUAA)n probe used to detect the 
activity, is tested with compounds or hTRT variants to determine if the same generates 
processivity, i.e., if the probe detects an expected telomerase substrate, then the 

1 0 compound or mutant is able to change the non-processive activity to a processive 

activity. Other assays for processive telomerase catalytic activity can also be used, e.g., 
the stretch PCR assay of Tatematsu et al., 1996, Oncogene 13:2265. The gel-blot 
assay, a combination of the conventional and dot blot assays can also be used. In this 
variation a conventional assay is performed with no radiolabeled nucleotide and with 

1 5 high dGTP concentrations (e.g., 0. 1-2 mM). After performing the conventional assay, 
the synthesized DNA is separated by denaturing PAGE and transferred to a membrane 
(e.g., nitrocellulose). Telomeric DNA (the product of telomerase - an extended 
telomerase primer or substrate) can then be detected by methods such as hybridization 
using labeled telomeric DNA probes (e.g., probes containing the CCCTAA sequence, 

20 as used in the dot blot assay, supra) An advantage of this technique is that it is more 
sensitive than the conventional assay and provides information about the size of the 
synthesized fragments and processivity of the reaction. 

c) Activity determinations 

25 The telomerase activity of an hTRT polypeptide can be determined 

using an unpurified, partially purified or substantially purified hTRT polypeptide (e.g., 
in association with hTR), in vitro, or after expression in vivo. For example, telomerase 
activity in a cell (e.g., a cell expressing a recombinant hTRT polypeptide of the 
invention) can be assayed by detecting an increase or decrease in the length of 

30 telomeres. Typically assays for telomerase catalytic activity are carried out using an 
hTRT complexed with hTR; however, alternative telomerase template RNAs may be 
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substituted, or one may conduct assays to measure another activity, such as telomerase- 
primer binding, Assays to determine the length of telomeres are known in the art and 
include hybridization of probes to telomeric DNA (an amplification step can be 
included) and TRF analysis i.e., the analysis of telomeric DNA restriction fragments 
5 [TRFs] following restriction endonuclease digestion, see PCT publications WO 

93/23572 and WO 96/41016; Counter et al., 1992, EMBOJ. 11:1921; Allsopp et al., 
1992, Proc. Natl Acad. Set USA 89:10114; Sanno, 1996, Am J Clin Pathol 106:16 
and Sanno, 1997, Neuroendocrinology 65:299. 

The telomerase catalytic activity of an hTRT polypeptide may be 

1 0 determined in a number of ways using the assays supra and other telomerase catalytic 
activity assays. According to one method, the hTRT protein is expressed (e.g., as 
described infra) in a telomerase negative human cell in which hTR is expressed (i.e., 
either normally in the cell or through recombinant expression), and the presence or 
absence of telomerase activity in the cell or cell lysate is determined. Examples of 

1 5 suitable telomerase-negative cells are IMR 90 (ATCC, #CCL-1 86) or BJ cells (human 
foreskin fibroblast line; see, e.g., Feng et al., 1995, Science 269:1236). Other 
examples include retinal pigmented epithelial cells (RPE), human umbilical vein 
endothelial cells (HUVEC; ATCC #CRL-1730), human aortic endothelial cells (HAEC; 
Clonetics Corp, #CC-2535), and human mammary epithelial cells (HME; Hammond et 

20 al, 1984, Proc. Natl Acad. Sci. USA 81:5435; Stampfer, 1985, J. Tissue Culture 
Methods 9:107). In an alternative embodiment, the hTRT polypeptide is expressed 
(e.g., by transfection with an hTRT expression vector) in a telomerase positive cell, and 
an increase in telomerase activity in the cell compared to an untransfected control cell is 
detected if the polypeptide has telomerase catalytic activity. Usually the telomerase 

25 catalytic activity in a cell transfected with a suitable expression vector expressing hTRT 
will be significantly increased, such as at least about 2-fold, at least about 5-fold, or 
even at least about 10-fold to 100-fold or even 1000-fold higher than in untransfected 
(control) cells. 

In an alternative embodiment, the hTRT protein is expressed in a cell 
30 (e.g., a telomerase negative cell in which hTR is expressed) as a fusion protein (see 

infra) having a label or an "epitope tag" to aid in purification. In one embodiment, the 
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RNP is recovered from the cell using an antibody that specifically recognizes the tag. 
Preferred tags are typically short or small and may include a cleavage site or other 
property that allows the tag to be removed from the hTRT polypeptide. Examples of 
suitable tags include the Xpress™ epitope (Invitrogen, Inc., San Diego CA), and other 
5 moieties that can be specifically bound by an antibody or nucleic acid or other 

equivalent method such as those described in Example 6. Alternative tags include 
those encoded by sequences inserted, e.g., into SEQUENCE ID NO: 1 upstream of the 
ATG codon that initiates translation of the protein of SEQUENCE ID NO: 2, which 
may include insertion of a (new) methionine initiation codon into the upstream 
10 sequence. 

It will be appreciated that when an hTRT variant is expressed in a cell 
(e.g., as a fusion protein) and subsequently isolated (e.g., as a ribonucleoprotein 
complex), other cell proteins (i.e., telomerase-associated proteins) may be associated 
with (directly or indirectly bound to) the isolated complex. In such cases, it will 
1 5 sometimes be desirable to assay telomerase activity for the complex containing hTRT, 
hTR and the associated proteins. 

2) OTHER TELOMERASE OR TRT PROTEIN ACTIVITIES 

The hTRT polypeptides of the invention include variants that lack 
20 telomerase catalytic activity but retain one or more other activities of telomerase. 

These other activities and the methods of the invention for measuring such activities 
include (but are not limited to) those discussed in the following sections, 
a) Conventional reverse transcriptase activity 

Telomerase conventional reverse transcriptase activity is described in, 
25 e.g., Morin, 1997, supra, and Spence et al., 1995, Science 267:988. Because hTRT 
contains conserved amino acid motifs that are required for reverse transcriptase 
catalytic activity, hTRT has the ability to transcribe certain exogenous (e.g., non-hTR) 
RNAs. A conventional RT assay measures the ability of the enzyme to transcribe an 
RNA template by extending an annealed DN A primer. Reverse transcriptase activity 
30 can be measured in numerous ways known in the art, for example, by monitoring the 

size increase of a labeled nucleic acid primer (e.g., RNA or DNA), or incorporation of a 
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labeled dNTP. See, e.g., Ausubel et al., supra. 

Because hTRT specifically associates with hTR, it can be appreciated 
that the DNA primer/RNA template for a conventional RT assay can be modified to 
have characteristics related to hTR and/or a telomeric DNA primer. For example, the 
5 RNA can have the sequence (CCCTAA) n , where n is at least 1 , or at least 3, or at least 
10 or more. In one embodiment, the (CCCTAA) n region is at or near the 5 1 terminus of 
the RNA (similar to the 5 ! locations of template regions in telomerase RNAs). 
Similarly, the DNA primer may have a 3' terminus that contains portions of the 
TTAGGG telomere sequence, for example X n TTAG, X n AGGG, X n (TTAGGG) q TTAG, 

10 etc., where X is a non-telomeric sequence and n is 8-20, or 6-30, and q is 1-4. In 

another embodiment, the DNA primer has a 5 ! terminus that is non-complementary to 
the RNA template, such that when the primer is annealed to the RNA, the 5 1 terminus of 
the primer remains unbound. Additional modifications of standard reverse transcription 
assays that may be applied to the methods of the invention are known in the art. 

15 b) Nucleolytic activity 

Telomerase nucleolytic activity is described in e.g., Morin, 1997, supra; 
Collins and Grieder, 1993, Genes and Development 7:1364. Telomerase possesses a 
nucleolytic activity (Joyce and Steitz, 1987, Trends Biochem. ScL 12:288); however, 
telomerase activity has defining characteristics. Telomerase preferentially removes 

20 nucleotides, usually only one, from the 3' end of an oligonucleotide when the 3' end of 
the DNA is positioned at the 5 r boundary of the DNA template sequence, in humans 
and Tetrahymena^ this nucleotide is the first G of the telomeric repeat (TTAGG in 
humans). Telomerase preferentially removes G residues but has nucleolytic activity 
against other nucleotides. This activity can be monitored. Two different methods are 

25 described here for illustrative purposes. One method involves a conventional 
telomerase reaction with a primer that binds the entire template sequence (i.e., 
terminating at the template boundary; 5-TAGGGATTAG in humans). Nucleolytic 
activity is observed by monitoring the replacement of the last dG residue with a 
radiolabeled dGTP provided in the assay. The replacement is monitored by the 

30 appearance of a band at the size of the starting primer as shown by gel electrophoresis 
and autoradiography. 
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A preferred method uses a DNA primer that has a "blocked" 3' terminus that 
cannot be extended by telomerase. The 3 -blocked primer can be used in a standard 
telomerase assay but will not be extended unless the 3' nucleotide is removed by the 
nucleolytic activity of telomerase. The advantage of this method is that telomerase 
5 activity can be monitored by any of several standard means, and the signal is strong and 
easy to quantify. The blocking of the 3 r terminus of the primer can be accomplished in 
several ways. One method is the addition of a 3'-deoxy-dNTP residue at the 3 1 terminus 
of the primer using standard oligonucleotide synthesis techniques. This terminus has a 
2' OH but not the 3 ? OH required for telomerase. Other means of blocking the 3' 

10 terminus exist, for instance, a 3 f dideoxy terminus, a 3-amine terminus, and others. An 
example of a primer for an hTRT nucleolytic assay is 5' -TTAGGGTTAGGGTTA 
(G 3U ) where the last residue denotes a 3'-deoxy-guanosine residue (Glen Research, 
Sterling, VA). Numerous other variations for a suitable primer based on the disclosure 
are known to those of skill in the art. 

15 c) Primer (telomere) binding activity 

Telomerase primer (telomere) binding activity is described in e.g., 
Morin, 1997, supra; Collins et al., 1995, Cell 81:677; Harrington et al, 1995, J. Biol. 
Chem. 270:8893. Telomerase is believed to have two sites which bind a telomeric 
DNA primer. The RT motifs associated with primer binding indicate hTRT and/or 

20 hTRT/hTR possesses DNA primer binding activity. There are several ways of assaying 
primer binding activity; however, a step common to most methods is incubation of a 
labeled DNA primer with hTRT or hTRT/hTR or other TRT/TR combinations under 
appropriate binding conditions. Also, most methods employ a means of separating 
unbound DNA from protein-bound DNA; those methods include the following. 

25 i) Gel-shift assays (also called electrophoretic/mobility shift assays) are 

those in which unbound DNA primer is separated from protein-bound DNA primer by 
electrophoresis on a nondenaturing gel (Ausubel et al., supra). 

ii) Matrix binding assays include several variations to the basic 
technique, which involves binding the hTRT or hTRT/hTR complex to a matrix (e.g., 

30 nitrocellulose), either before or after incubation with the labeled primer. By binding the 
hTRT to a matrix, the unbound primer can be mechanically separated from bound 
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primer. Residual unbound DNA can be removed by washing the membrane prior to 
quantitation. Those of skill recognize there are several means of coupling proteins to 
such matrices, solid supports, and membranes, including chemical, photochemical, UV 
cross-linking, antibody/epitope, and non-covalent (hydrophobic, electrostatic, etc.) 
interactions. 

The DNA primer can be any DNA with an affinity for telomerase, such 
as, for example, a telomeric DNA primer like (TTAGGG) n , where n could be 1-10 and 
is typically 3-5. The 3' and 5 f termini can end in any location of the repeat sequence. 
The primer can also have 5 f or 3' extensions of non-telomeric DNA that could facilitate 
labeling or detection. The primer can also be derivatized, e.g., to facilitate detection or 
isolation. 

d) dNTP binding activity 

Telomerase dNTP binding activity is described in e.g., Morin, 1997, 
supra; Spence et al., supra. Telomerase requires dNTPs to synthesize DNA. The 
hTRT protein has a nucleotide binding activity and can be assayed for dNTP binding in 
a manner similar to other nucleotide binding proteins (Kantrowitz et al., 1980, Trends 
Biochem. Set 5:124). Typically, binding of a labeled dNTP or dNTP analog can be 
monitored as is known in the art for non-telomerase RT proteins. 

e) RNA (i.e., hTR) binding activity 

Telomerase RNA (i.e., hTR) binding activity is described in e.g., Morin, 
1997, supra; Harrington et al., 1997, Science 275:973; Collins et al., 1995, Cell 81 :677. 
The RNA binding activity of a TRT protein of the invention may be assayed in a 
manner similar to the DNA primer binding assay described supra, using a labeled RNA 
probe. Methods for separating bound and unbound RNA and for detecting RNA are 
well known in the art and can be applied to the activity assays of the invention in a 
manner similar to that described for the DNA primer binding assay. The RNA can be 
full length hTR, fragments of hTR or other RNAs demonstrated to have an affinity for 
telomerase or hTRT. See U.S. Patent No. 5,583,016 and PCT Pub. No. 96/40868. 

3) TELOMERASE MOTIFS AS TARGETS 

The present invention, as noted supra, provides in addition to 

57 



# 



recombinant hTRT with a full complement (as described supra) of activities, hTRT 
polypeptides having less than the full complement of the telomerase activities of 
naturally occurring telomerase or hTRT or other TRT proteins. It will be appreciated 
that, in view of the disclosure herein of the RT and telomerase-specific motifs of TRT, 
alteration or mutation of conserved amino acid residues, such as are found in the motif 
sequences discussed supra, will result in loss-of activity mutants useful for therapeutic, 
drug screening and characterization, and other uses. For example, as described in 
Example 1, deletion of motifs B through D in the RT domains of the endogenous TRT 
gene in S. pombe resulted in haploid cells in which telomere progressively shortened to 
the point where hybridization of a telomere probe to telomeric repeats became almost 
undetectable, indicating a loss of telomerase catalytic activity. Similarly, alterations in 
the WxGxS site of motif E can affect telomerase DNA primer binding or function. 
Additionally, alterations of the amino acids in the motifs A, B', and C can affect the 
catalytic activity of telomerase. Mutation of the DD motif of hTRT can significantly 
reduce or abolish telomerase activity (see Example 16). 

C) SYNTHESIS OF hTRT AND OTHER TRT POLYPEPTIDES 

The invention provides a variety of methods for making the hTRT and 
other TRT polypeptides disclosed herein. In the following sections, chemical synthesis 
and recombinant expression of hTRT proteins, including fusion proteins, is described in 
some detail. 

1) CHEMICAL SYNTHESIS 

The invention provides hTRT polypeptides synthesized, entirely or in 
part, using general chemical methods well known in the art (see e.g., Caruthers et aL, 
1980, Nucleic Acids Res. Symp. Ser., 215-223; and Horn et aL, 1980, Nucleic Acids Res. 
Symp. Ser., 225-232). For example, peptide synthesis can be performed using various 
solid-phase techniques (Roberge, et aL, 1995, Science 269:202), including automated 
synthesis (e.g., using the Perkin Elmer ABI 431 A Peptide Synthesizer in accordance 
with the instructions provided by the manufacturer). When full length protein is 
desired, shorter polypeptides may be fused by condensation of the amino terminus of 
one molecule with the carboxyl terminus of the other molecule to form a peptide bond. 
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The newly synthesized peptide can be substantially purified, for 
example, by preparative high performance liquid chromatography (e.g., Creighton, 
Proteins, Structures and Molecular Principles, WH Freeman and Co, New York 
NY [1983]). The composition of the synthetic peptides (or any other peptides or 
5 polypeptides of the invention) may be confirmed by amino acid analysis or sequencing 
(e.g., the Edman degradation procedure; Creighton, supra). Importantly, the amino 
acid sequence of hTRT, or any part thereof, may be altered during direct synthesis 
and/or combined using chemical methods with sequences from other proteins or 
otherwise, or any part thereof or for any purpose, to produce a variant polypeptide of 
10 the invention. 

2) RECOMBINANT EXPRESSION OF hTRT AND OTHER TRT 
PROTEINS 

The present invention provides methods, reagents, vectors, and cells 

1 5 useful for expression of hTRT polypeptides and nucleic acids using in vitro (cell-free), 
ex vivo or in vivo (cell or organism-based) recombinant expression systems. In one 
embodiment, expression of the hTRT protein, or fragment thereof, comprises inserting 
the coding sequence into an appropriate expression vector {i.e., a vector that contains 
the necessary elements for the transcription and translation of the inserted coding 

20 sequence required for the expression system employed). Thus, in one aspect, the 

invention provides for a polynucleotide substantially identical in sequence to an hTRT 
gene coding sequence at least 25 nucleotides, and preferably for many applications 50 
to 100 nucleotides or more, of the hTRT cDNAs or genes of the invention, which is 
operably linked to a promoter to form a transcription unit capable of expressing an 

25 hTRT polypeptide. Methods well known to those skilled in the art can be used to 
construct the expression vectors containing an hTRT sequence and appropriate 
transcriptional or translational controls provided by the present invention (see, e.g., 
Sambrook et al., supra, Ausubel et al. supra, and this disclosure). 

The hTRT polypeptides provided by the invention include fusion 

30 proteins that contain hTRT polypeptides or fragments of the hTRT protein. The fusion 
proteins are typically produced by recombinant means, although they may also be made 
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by chemical synthesis. Fusion proteins can be useful in providing enhanced expression 
of the hTRT polypeptide constructs, or in producing hTRT polypeptides having other 
desirable properties, for example, comprising a label (such as an enzymatic reporter 
group), binding group, or antibody epitope. An exemplary fusion protein, comprising 
5 hTRT and enhanced green fluorescent protein (EGFP) sequences is described in 
Example 15, infra. It will be apparent to one of skill that the uses and applications 
discussed in Example 15 and elsewhere herein are not limited to the particular fusion 
protein, but are illustrative of the uses of various fusion constructs. 

The fusion protein systems of the invention can also be used to facilitate 

10 efficient production and isolation of hTRT proteins or peptides. For example, in some 
embodiments, the non-hTRT sequence portion of the fusion protein comprises a short 
peptide that can be specifically bound to an immobilized molecule such that the fusion 
protein can be separated from unbound components (such as unrelated proteins in a cell 
lysate). One example is a peptide sequence that is bound by a specific antibody. 

1 5 Another example is a peptide comprising polyhistidine tracts e.g. (His) 6 or 

histidine-tryptophan sequences that can be bound by a resin containing nickel or copper 
ions (i.e., metal-chelate affinity chromatography). Other examples include Protein A 
domains or fragments, which allow purification on immobilized immunoglobulin, and 
the domain utilized in the FLAGS extension/affinity purification system (Immunex 

20 Corp, Seattle WA). In some embodiments, the fusion protein includes a cleavage site 
so that the hTRT or other TRT polypeptide sequence can be easily separated from the 
non-hTRT peptide or protein sequence. In this case, cleavage may be chemical (e.g., 
cyanogen bromide, 2-(2-nitrophenylsulphenyl)-3-methyl-3 , -bromoindolene, 
hydroxylamine, or low pH) or enzymatic (e.g., Factor Xa, enterokinase). The choice of 

25 the fusion and cleavage systems may depend, in part, on the portion (i.e., sequence) of 
the hTRT polypeptide being expressed. Fusion proteins generally are described in 
Ausubel et al., supra, Ch. 16, Kroll et al., 1993, DNA Cell Biol 12:441, and the 
Invitrogen 1997 Catalog (Invitrogen Inc, San Diego CA). Other exemplary fusion 
proteins of the invention with epitope tags or tags and cleavage sites are provided in 

30 Example 6, infra. 

It will be appreciated by those of skill that, although the expression 
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systems discussed in this section are focused on expression of hTRT polypeptides, the 
same or similar cells, vectors and methods may be used to express hTRT 
polynucleotides of the invention, including sense and antisense polynucleotides without 
necessarily desiring production of hTRT polypeptides. Typically, expression of a 
polypeptide requires a suitable initiation codon (e.g., methionine), open reading frame, 
and translational regulatory signals (e.g., a ribosome binding site, a termination codon) 
which may be omitted when translation of a nucleic acid sequence to produce a protein 
is not desired. 

Expression of hTRT polypeptides and polynucleotides may be carried 
out to accomplish any of several related benefits provided by the present invention. 
One illustrative benefit is expression of hTRT polypeptides that are subsequently 
isolated from the cell in which they are expressed (for example for production of large 
amounts of hTRT for use as a vaccine or in screening applications to identify 
compounds that modulate telomerase activity). A second illustrative benefit is 
expression of hTRT in a cell to change the phenotype of the cell (as in gene therapy 
applications). Nonmammalian cells can be used for expression of hTRT for 
purification, while eukaryotic especially mammalian cells (e.g., human cells) can be 
used not only for isolation and purification of hTRT but also for expression of hTRT 
when a change in phenotype in a cell is desired (e.g., to effect a change in proliferative 
capacity as in gene therapy applications). By way of illustration and not limitation, 
hTRT polypeptides having one or more telomerase activities (e.g. telomerase catalytic 
activity) can be expressed in a host cell to increase the proliferative capacity of a cell 
(e.g., immortalize a cell) and, conversely, hTRT antisense polynucleotides or inhibitory 
polypeptides typically can be expressed to reduce the proliferative capacity of a cell 
(e.g., of a telomerase positive malignant tumor cell). Numerous specific applications 
are described herein, e.g., in the discussion of uses of the reagents and methods of the 
invention for therapeutic applications, below. 

Illustrative useful expression systems (cells, regulatory elements, vectors 
and expression) of the present invention include a number of cell-free systems such as 
reticulocyte lysate and wheat germ systems using hTRT polynucleotides in accordance 
with general methods well known in the art (see, e.g., Ausubel et al. supra at Ch. 10). 
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In alternative embodiments, the invention provides reagents and methods for expressing 
hTRT in prokaryotic or eukaryotic cells. Thus, the present invention provides nucleic 
acids encoding hTRT polynucleotides, proteins, protein subsequences, or fusion 
proteins that can be expressed in bacteria, fungi, plant, insect, and animal, including 
5 human cell expression systems known in the art, including isolated cells, cell lines, cell 
cultures, tissues, and whole organisms. As will be understood by those of skill, the 
hTRT polynucleotides introduced into a host cell or cell free expression system will 
usually be operably linked to appropriate expression control sequences for each host or 
cell free system. 

10 Useful bacterial expression systems include E. coli, bacilli (such as 

Bacillus subtilus), other enterobacteriaceae (such as Salmonella,, Serratia, and various 
Pseudomonas species) or other bacterial hosts (e.g., Streptococcus cremoris, 
Streptococcus lactis, Streptococcus thermophilus, Leuconostoc citrovorum, 
Leuconostoc mesenteroides, Lactobacillus acidophilus, Lactobacillus lactis, 

1 5 Bifidobacterium bifidum, Bifidobacteriu breve, and Bifidobacterium longum). The 

hTRT expression constructs useful in prokaryotes include recombinant bacteriophage, 
plasmid or cosmid DNA expression vectors, or the like, and typically include promoter 
sequences. Illustrative promoters include inducible promoters, such as the lac 
promoter, the hybrid lacZ promoter of the Bluescript7 phagemid [Stratagene, La Jolla 

20 CA] or pSportl [Gibco BRL]; phage lambda promoter systems; a tryptophan (trp) 
promoter system; and ptrp-lac hybrids and the like. Bacterial expression constructs 
optionally include a ribosome binding site and transcription termination signal 
regulatory sequences. Illustrative examples of specific vectors useful for expression 
include, for example, pTrcHis2, (Invitrogen, San Diego CA), pThioHis A, B & C, and 

25 numerous others known in the art or that may be developed (see, e.g. Ausubel). Useful 
vectors for bacteria include those that facilitate production of hTRT- fusion proteins. 
Useful vectors for high level expression of fusion proteins in bacterial cells include, but 
are not limited to, the multifunctional E. coli cloning and expression vectors such as 
Bluescript7 (Stratagene), noted above, in which the sequence encoding hTRT protein, 

30 an hTRT fusion protein or an hTRT fragment may be ligated into the vector in-frame 
with sequences for the amino-terminal Met and the subsequent 7 residues of 
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P-galactosidase so that a hybrid protein is produced (e.g., pIN vectors; Van Heeke and 
Schuster, 1989, J. Biol Chem., 264:5503). Vectors such as pGEX vectors (e.g., pGEX- 
2TK; Pharmacia Biotech) may also be used to express foreign polypeptides, such as 
hTRT protein, as fusion proteins with glutathione S-transferase (GST). Such fusion 
5 proteins may be purified from lysed cells by adsorption to glutathione-agarose beads 
followed by elution in the presence of free glutathione. Proteins made in such systems 
often include enterokinase, thrombin or factor Xa protease cleavage sites so that the 
cloned polypeptide of interest can be released from the GST moiety at will, as may be 
useful in purification or other applications. Other examples are fusion proteins 

1 0 comprising hTRT and the E. coli Maltose Binding Protein (MBP) or E. Coli 

thioredoxin. Illustrative examples of hTRT expression constructs useful in bacterial 
cells are provided in Example 6, infra. 

The invention further provides hTRT polypeptides expressed in fungal 
systems, such as Dictyostelium and, preferably, yeast, such as Saccharomyces 

15 cerevisiae, Pichia pastoris, Torulopsis holmil, Saccharomyces fragilis, Saccharomyces 
lactis, Hansenula polymorpha and Candida pseudotropicalis. When hTRT is 
expressed in yeast, a number of suitable vectors are available, including plasmid and 
yeast artificial chromosomes (YACs) vectors. The vectors typically include expression 
control sequences, such as constitutive or inducible promoters (e.g., such as alpha 

20 factor, alcohol oxidase, PGH, and 3-phosphoglycerate kinase or other glycolytic 

enzymes), and an origin of replication, termination sequences and the like, as desired. 
Suitable vectors for use in Pichia include pPICZ, His6/pPICZB, pPICZalpha, 
pPIC3.5K, pPIC9K, pA0815, pGAP2A, B & C, pGAP2alpha A, B, and C (Invitrogen, 
San Diego, CA) and numerous others known in the art or to be developed. In one 

25 embodiment, the vector His6/pPICZB (Invitrogen, San Diego, CA) is used to express a 
His 6 -hTRT fusion protein in the yeast Pichia pastoris. An example of a vector useful in 
Saccharomyces is pYES2 (Invitrogen, San Diego, CA). Illustrative examples of hTRT 
expression constructs useful in yeast are provided in Example 6, infra. 

The hTRT polypeptides of the invention may also be expressed in plant 

30 cell systems transfected with plant or plant virus expression vectors (e.g., cauliflower 
mosaic virus, CaMV ; tobacco mosaic virus, TMV) or transformed with bacterial 
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expression vectors (e.g., Ti or pBR322 plasmid). In cases where plant virus expression 
vectors are used, the expression of an hTRT-encoding sequence may be driven by any 
of a number of promoters. For example, viral promoters such as the 35S and 19S 
promoters of CaMV (Brisson et aL, 1984, Nature 310:51 1-514) may be used alone or in 
combination with the omega leader sequence from TMV (Takamatsu et aL, 1987, 
EMBO J., 6:307-3 1 1). Alternatively, plant promoters such as that from the small 
subunit gene of RUBISCO (Coruzzi et aL, 1984, EMBO J., 3:1671-1680; Broglie et aL, 
1984, Science 224:838-843) or heat shock promoters (Winter and Sinibaldi, 1991, 
Results Probl. Cell Differ., 17:85), or storage protein gene promoters may be used. 
These constructs can be introduced into plant cells by direct DNA transformation or 
pathogen-mediated transfection (for reviews of such techniques, see Hobbs or Murry, 
1992, in McGraw Hill Yearbook of Science and Technology McGraw Hill New 
York NY, pp. 191-196 [1992]; or Weissbach and Weissbach, 1988, Methods for 
Plant Molecular Biology, Academic Press, New York NY, pp. 421-463). 

Another expression system provided by the invention for expression of 
hTRT protein is an insect system. A preferred system uses a baculovirus polyhedrin 
promoter. In one such system, Autographa californica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodopterafrugiperda cells or 
in Trichoplusia larvae. The sequence encoding the gene of interest may be cloned into 
a nonessential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the sequence, e.g., encoding the 
hTRT protein, will render the polyhedrin gene inactive and produce recombinant virus 
lacking coat protein. The recombinant viruses are then used to infect S. frugiperda 
cells or Trichoplusia larvae, in which the hTRT sequence is then expressed (see, for 
general methods, Smith et aL, J. Virol., 46:584 [1983]; Engelhard et aL, Proc. Natl. 
Acad. Sci. 91:3224-7 [1994]). Useful vectors for baculovirus expression include 
pBlueBacHis2 A, B & C, pBlueBac4.5, pMelBacB and numerous others known in the 
art or to be developed. Illustrative examples of hTRT expression constructs useful in 
insect cells are provided in Example 6, infra. 

The present invention also provides expression systems in mammals and 
mammalian cells. As noted supra, hTRT polynucleotides may be expressed in 
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mammalian cells (e.g., human cells) for production of significant quantities of hTRT 
polypeptides (e.g., for purification) or to change the phenotype of a target cell (e.g., for 
purposes of gene therapy, cell immortalization, or other). In the latter case, the hTRT 
polynucleotide expressed may or may not encode a polypeptide with a telomerase 
5 catalytic activity. That is, expression may be of a sense or antisense polynucleotide, an 
inhibitory or stimulatory polypeptide, a polypeptide with zero, one or more telomerase 
activities, and other combinations and variants disclosed herein or apparent to one of 
skill upon review of this disclosure. 

Suitable mammalian host tissue culture cells for expressing the nucleic 

10 acids of the invention include any normal mortal or normal or abnormal immortal 
animal or human cell, including: monkey kidney CV1 line transformed by SV40 
(COS-7, ATCC CRL 1651); human embryonic kidney line (293; Graham et al, J. Gen. 
Virol 36:59 (1977)); baby hamster kidney cells (BHK, ATCC CCL 10); CHO (ATCC 
CCL 61 and CRL 9618); mouse Sertoli cells (TM4, Mather, Biol Reprod. 23:243-251 

15 (1980)); monkey kidney cells (CV1 ATCC CCL 70); African green monkey kidney 
cells (VERO-76, ATCC CRL 1587); human cervical carcinoma cells (HeLa, ATCC 
CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3 A, 
ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep 
G2, HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells 

20 (Mather, et al;, Annals MY. Acad Sci. 383:44-46 (1982); MDCK cells (ATCC CCL 34 
and CRL 6253); HEK 293 cells (ATCC CRL 1573); and WI-38 cells (ATCC CCL 75; 
ATCC: American Type Culture Collection, Rockville, MD). The use of mammalian 
tissue cell culture to express polypeptides is discussed generally in Winnacker, From 
Genes to Clones (VCH Publishers, N.Y., N.Y., 1987). 

25 For mammalian host cells, viral-based and nonviral expression systems 

are provided. Nonviral vectors and systems include plasmids and episomal vectors, 
typically with an expression cassette for expressing a protein or RNA, and human 
artificial chromosomes (see, e.g., Harrington et al., 1997, Nat Genet 15:345). For 
example, nonviral vectors useful for expression of hTRT polynucleotides and 

30 polypeptides in mammalian (e.g., human) cells include pcDNA3.1/His, pEBVHis A, B 
& C, (Invitrogen, San Diego CA), MPSV vectors, others described in the Invitrogen 
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1997 Catalog (Invitrogen Inc, San Diego CA), which is incorporated in its entirety 
herein, and numerous others known in the art for other proteins. Illustrative examples 
of hTRT expression constructs useful in mammalian cells are provided in Example 6, 
infra. 



adenoassociated viruses, herpes viruses, vectors based on SV40, papilloma virus, HBP 
Epstein Barr virus, vaccinia virus vectors and Semliki Forest virus (SFV). SFV and 
vaccinia vectors are discussed generally in Ausubel et al., supra, Ch 16. These vectors 
are often made up of two components, a modified viral genome and a coat structure 

10 surrounding it {see generally Smith, 1995, Annu. Rev. Microbiol 49: 807), although 

sometimes viral vectors are introduced in naked form or coated with proteins other than 
viral proteins. However, the viral nucleic acid in a vector may be changed in many 
ways, for example, when designed for gene therapy. The goals of these changes are to 
disable growth of the virus in target cells while maintaining its ability to grow in vector 

1 5 form in available packaging or helper cells, to provide space within the viral genome 
for insertion of exogenous DNA sequences, and to incorporate new sequences that 
encode and enable appropriate expression of the gene of interest. Thus, vector nucleic 
acids generally comprise two components: essential cis-acting viral sequences for 
replication and packaging in a helper line and the transcription unit for the exogenous 

20 gene. Other viral functions are expressed in trans in a specific packaging or helper cell 
line. Adenoviral vectors (e.g., for use in human gene therapy) are described in, e.g., 
Rosenfeld et al., 1992, Cell 68: 143; PCT publications WO 94/12650; 94/12649; and 
94/12629. In cases where an adenovirus is used as an expression vector, a sequence 
encoding hTRT may be ligated into an adenovirus transcription/translation complex 

25 consisting of the late promoter and tripartite leader sequence. Insertion in a 

nonessential El or E3 region of the viral genome will result in a viable virus capable of 
expressing in infected host cells (Logan and Shenk, 1984, Proc. Natl Acad. ScL, 
81 :3655). Replication-defective retroviral vectors harboring a therapeutic 
polynucleotide sequence as part of the retroviral genome are described in, e.g., Miller et 



30 al., 1990, Mol Cell Biol 10: 4239; Kolberg, 1992, J. NIHRes. 4: 43; and Cornetta et 
al., 1991, Hum. Gene Ther. 2: 215. 



5 



Useful viral vectors include vectors based on retroviruses, adenoviruses, 
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In mammalian cell systems, promoters from mammalian genes or from 
mammalian viruses are often appropriate. Suitable promoters may be constitutive, cell 
type-specific, stage-specific, and/or modulatable or regulatable (e.g., by hormones such 
as glucocorticoids). Useful promoters include, but are not limited to, the 
5 metallothionein promoter, the constitutive adenovirus major late promoter, the 
dexamethasone-inducible MMTV promoter, the SV40 promoter, the MRP polIII 
promoter, the constitutive MPSV promoter, the tetracycline-inducible CMV promoter 
(such as the human immediate-early CMV promoter), the constitutive CMV promoter, 
and promoter-enhancer combinations known in the art. 
1 o Other regulatory elements may also be required or desired for efficient 

expression of an hTRT polynucleotide and/or translation of a sequence encoding hTRT 
proteins. For translation, these elements typically include an ATG initiation codon and 
adjacent ribosome binding site or other sequences. For sequences encoding the hTRT 
protein, provided its initiation codon and upstream promoter sequences are inserted into 
1 5 an expression vector, no additional translational or other control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, 
exogenous transcriptional and/or translational control signals (e.g., the promoter, 
ribosome-binding site, and ATG initiation codon) must often be provided. 
Furthermore, the initiation codon must typically be in the correct reading frame to 
20 ensure translation of the desired protein. Exogenous transcriptional elements and 

initiation codons can be of various origins, both natural and synthetic. In addition, the 
efficiency of expression may be enhanced by the inclusion of enhancers appropriate to 
the cell system in use (Scharf et al., 1994, Results Probl Cell Differ. 20:125;and 
Bittner et al. 1987, Meth. EnzymoL, 153:516). For example, the SV40 enhancer or 
25 CMV enhancer may be used to increase expression in mammalian host cells. 

Expression of hTRT gene products can also by effected (increased) by 
activation of an hTRT promoter or, enhancer in a cell such as a human cell, e.g., a 
telomerase-negative cell line. Activation can be carried out in a variety of ways, 
including administration of an exogenous promoter activating agent, or inhibition of a 
30 cellular component that suppresses expression of the hTRT gene. It will be appreciated 
that, conversely, inhibition of promoter function, as described infra, will reduce hTRT 



67 



# # 

gene expression. 

The invention provides inducible and repressible expression of hTRT 
polypeptides using such system as the Ecdysone-Inducible Expression System 
(Invitrogen), and the Tet-On and Tet-off tetracycline regulated systems from Clontech. 
5 The ecdysone-inducible expression system uses the steroid hormone ecdysone analog, 
muristerone A, to activate expression of a recombinant protein via a heterodimeric 
nuclear receptor (No et al., 1996, Proc. Natl. Acad. Set USA 93:3346). In one 
embodiment of the invention, hTRT is cloned in the pIND vector (Clontech), which 
contains five modified ecdysone response elements (E/GREs) upstream of a minimal 

10 heat shock promoter and the multiple cloning site. The construct is then transfected in 
cell lines stably expressing the ecdysone receptor. After transfection, cells are treated 
with muristerone A to induce intracellular expression from pIND. In another 
embodiment of the invention, hTRT polypeptide is expressed using the Tet-on and 
Tet-off expression systems (Clontech) to provide regulated, high-level gene expression 

15 (Gossen et al., 1992, Proc. Natl Acad Set USA 89:5547; Gossen et al., 1995, Science 
268:1766). 

The hTRT vectors of the invention may be introduced into a cell, tissue, 
organ, patient or animal by a variety of methods. The nucleic acid expression vectors 
(typically dsDNA) of the invention can be transferred into the chosen host cell by 

20 well-known methods such as calcium chloride transformation (for bacterial systems), 
electroporation, calcium phosphate treatment, liposome-mediated transformation, 
injection and microinjection, ballistic methods, virosomes, immunoliposomes, 
polycation:nucleic acid conjugates, naked DNA, artificial virions, fusion to the herpes 
virus structural protein VP22 (Elliot and O'Hare, Cell 88:223), agent-enhanced uptake 

25 of DNA, and ex vivo transduction. Useful liposome-mediated DNA transfer methods 
are described in US Patent Nos. 5,049,386, US 4,946,787; and US 4,897,355; PCT 
publications WO 91/17424, WO 91/16024; Wang and Huang, 1987, Biochem. Biophys. 
Res. Commun. 147: 980; Wang and Huang, 1989, Biochemistry 28: 9508; Litzinger and 
Huang, 1992, Biochem. Biophys, Acta 1 1 13:201; Gao and Huang, 1991, Biochem. 

30 Biophys. Res. Commun. 179: 280. Immunoliposomes have been described as carriers 
of exogenous polynucleotides (Wang and Huang, 1987, Proc. Natl. Acad. Set U.S.A. 
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84:785 1 ; Trubetskoy et al., 1 992, Biochem. Biophys. Acta 1 1 3 1 :3 1 1) and may have 
improved cell type specificity as compared to liposomes by virtue of the inclusion of 
specific antibodies which presumably bind to surface antigens on specific cell types. 
Behr et al., 1989, Proc. Natl Acad Set U.S.A. 86:6982 report using lipopolyamine as a 
reagent to mediate transfection itself, without the necessity of any additional 
phospholipid to form liposomes. Suitable delivery methods will be selected by 
practitioners in view of acceptable practices and regulatory requirements (e.g., for gene 
therapy or production of cell lines for expression of recombinant proteins). It will be 
appreciated that the delivery methods listed above may be used for transfer of nucleic 
acids into cells for purposes of gene therapy, transfer into tissue culture cells, and the 
like. 

For long-term, high-yield production of recombinant proteins, stable 
expression will often be desired. For example, cell lines which stably express hTRT 
can be prepared using expression vectors of the invention which contain viral origins of 
replication or endogenous expression elements and a selectable marker gene. 
Following the introduction of the vector, cells may be allowed to grow for 1-2 days in 
an enriched media before they are switched to selective media. The purpose of the 
selectable marker is to confer resistance to selection, and its presence allows growth of 
cells which successfully express the introduced sequences in selective media. 
Resistant, stably transfected cells can be proliferated using tissue culture techniques 
appropriate to the cell type. An amplification step, e.g., by administration of 
methyltrexate to cells transfected with a DHFR gene according to methods well known 
in the art, can be included. 

In addition, a host cell strain may be chosen for its ability to modulate 
the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, phosphorylation, lipidation and acylation. Post-translational 
processing may also be important for correct insertion, folding and/or function. 
Different host cells have cellular machinery and characteristic mechanisms specific for 
each cell for such post-translational activities and so a particular cell may be chosen to 
ensure the correct modification and processing of the introduced, foreign protein. 
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As noted supra, when expressing an hTRT. protein (including variants) 
in cells or organisms it is sometimes desirable to use an hTRT protein-encoding 
polynucleotide that employs a codon distribution other than that found in a naturally 
occurring hTRT gene. hTRT protein-encoding polynucleotides with alternative codons 
throughout, or at specific sites, in the coding sequence are used to optimize (e.g., 
increase) expression of the hTRT protein in cells, especially non-human cells (e.g., 
bacterial, plant, fungal, and non-human animal cells) which have different preferential 
codon usage than human cells. Codon changes may also be used to facilitate 
manipulation of the hTRT polynucleotide (e.g., by engineering useful tags or restriction 
sites into the coding sequence), and for other reasons. When the goal is to optimize 
expression (e.g., by increasing translational efficiency), tables of preferred codon usage, 
which are publicly available and are well known to those of skill, are used to design a 
suitable polynucleotide by "reverse translation" of the desired (e.g., hTRT) amino acid 
sequence. Alternatively, preferred codon usage can be determined for a particular 
organism (e.g., Pichia pastoris) or class of genes (e.g., highly expressed genes of a 
particular organism) by comparison of published gene sequences for the target 
organism or gene class. 

Illustrative hTRT-encoding polynucleotide sequences are provided in 
Table 9 (A-E), infra. All of the sequences in Table 9 are in the 5'->3\ Table 9A shows 
an hTRT protein encoding polynucleotide that uses a codon distribution preferentially 
employed in the bacterium E. coli. Table 9B shows a second polynucleotide sequence 
particularly useful for expression in E. coli (and other enteric bacteria) using codons 
preferentially used in highly expressed genes in enteric bacteria. Table 4C shows an 
hTRT protein encoding polynucleotide that uses a codon distribution preferentially 
employed in yeast (i.e., S. cerevisiae). Table 4D shows an hTRT protein encoding 
polynucleotide that uses a codon distribution preferentially used in highly expressed 
genes in yeast. Table 4E shows an hTRT protein encoding polynucleotide that uses a 
"generic" codon distribution that should be efficiently expressed in both bacteria (e.g., 
E. coli) and yeast (e.g., S. pombe, S. cerevisiae, P. pastoris) and some insect (e.g., S. 
frugiperda) cells. Such "generic" polynucleotide sequences (optimized for more than 
one organism) are useful for, for example, comparative studies, screening in different 
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organisms of hTRT binding or modulatory agents, creation of shuttle vectors, and other 
uses. In this "generic" sequence, the codon TCT (serine) may not be optimal for 
expression in Drosophila cells. Therefore, in an alternative embodiment the sequence 
in Table 4E is modified to replace TCT with TCC for efficient expression in 
Drosophila as well as bacteria and yeast. 



TABLE 9 

hTRT-ENCODING POLYNUCLEOTIDE SEQUENCES EMPLOYING 
AT TFKNATTVE CODON DISTRIBUTIONS 

Table 9A 
R colt fall genes) 

ATG CCG CGC GCG CCG CGC TGC CGC GCG GTG CGC AGC CTG CTG CGC AGC CAT TAT 
CGC GAA GTG CTG CCG CTG GCG ACC TTT GTG CGC CGC CTG GGC CCG CAG GGC TGG 
CGC CTG GTG CAG CGC GGC GAT CCG GCG GCG TTT CGC GCG CTG GTG GCG CAG TGC 
CTG GTG TGC GTG CCG TGG GAT GCG CGC CCG CCG CCG GCG GCG CCG AGC TTT CGC 
CAG GTG AGC TGC CTG AAA GAA CTG GTG GCG CGC GTG CTG CAG CGC CTG TGC GAA 
CGC GGC GCG AAA AAC GTG CTG GCG TTT GGC TTT GCG CTG CTG GAT GGC GCG CGC 
GGC GGC CCG CCG GAA GCG TTT ACC ACC AGC GTG CGC AGC TAT CTG CCG AAC ACC 
GTG ACC GAT GCG CTG CGC GGC AGC GGC GCG TGG GGC CTG CTG CTG CGC CGC GTG 
GGC GAT GAT GTG CTG GTG CAT CTG CTG GCG CGC TGC GCG CTG TTT GTG CTG GTG 
GCG CCG AGC TGC GCG TAT CAG GTG TGC GGC CCG CCG CTG TAT CAG CTG GGC GCG 
GCG ACC CAG GCG CGC CCG CCG CCG CAT GCG AGC GGC CCG CGC CGC CGC CTG GGC 
TGC GAA CGC GCG TGG AAC CAT AGC GTG CGC GAA GCG GGC GTG CCG CTG GGC CTG 
CCG GCG CCG GGC GCG CGC CGC CGC GGC GGC AGC GCG AGC CGC AGC CTG CCG CTG 
CCG AAA CGC CCG CGC CGC GGC GCG GCG CCG GAA CCG GAA CGC ACC CCG GTG GGC 
CAG GGC AGC TGG GCG CAT CCG GGC CGC ACC CGC GGC CCG AGC GAT CGC GGC TTT 
TGC GTG GTG AGC CCG GCG CGC CCG GCG GAA GAA GCG ACC AGC CTG GAA GGC GCG 
CTG AGC GGC ACC CGC CAT AGC CAT CCG AGC GTG GGC CGC CAG CAT CAT GCG GGC 
CCG CCG AGC ACC AGC CGC CCG CCG CGC CCG TGG GAT ACC CCG TGC CCG CCG GTG 
TAT GCG GAA ACC AAA CAT TTT CTG TAT AGC AGC GGC GAT AAA GAA CAG CTG CGC 
CCG AGC TTT CTG CTG AGC AGC CTG CGC CCG AGC CTG ACC GGC GCG CGC CGC CTG 
GTG GAA ACC ATT TTT CTG GGC AGC CGC CCG TGG ATG CCG GGC ACC CCG CGC CGC 
CTG CCG CGC CTG CCG CAG CGC TAT TGG CAG ATG CGC CCG CTG TTT CTG GAA CTG 
CTG GGC AAC CAT GCG CAG TGC CCG TAT GGC GTG CTG CTG AAA ACC CAT TGC CCG 
CTG CGC GCG GCG GTG ACC CCG GCG GCG GGC GTG TGC GCG CGC GAA AAA CCG CAG 
GGC AGC GTG GCG GCG CCG GAA GAA GAA GAT ACC GAT CCG CGC CGC CTG GTG CAG 
CTG CTG CGC CAG CAT AGC AGC CCG TGG CAG GTG TAT GGC TTT GTG CGC GCG TGC 
CTG CGC CGC CTG GTG CCG CCG GGC CTG TGG GGC AGC CGC CAT AAC GAA CGC CGC 
TTT CTG CGC AAC ACC AAA AAA TTT ATT AGC CTG GGC AAA CAT GCG AAA CTG AGC 
CTG CAG GAA CTG ACC TGG AAA ATG AGC GTG CGC GAT TGC GCG TGG CTG CGC CGC 
AGC CCG GGC GTG GGC TGC GTG CCG GCG GCG GAA CAT CGC CTG CGC GAA GAA ATT 
CTG GCG AAA TTT CTG CAT TGG CTG ATG AGC GTG TAT GTG GTG GAA CTG CTG CGC 
AGC TTT TTT TAT GTG ACC GAA ACC ACC TTT CAG AAA AAC CGC CTG TTT TTT TAT 
CGC AAA AGC GTG TGG AGC AAA CTG CAG AGC ATT GGC ATT CGC CAG CAT CTG AAA 
CGC GTG CAG CTG CGC GAA CTG AGC GAA GCG GAA GTG CGC CAG CAT CGC GAA GCG 
CGC CCG GCG CTG CTG ACC AGC CGC CTG CGC TTT ATT CCG AAA CCG GAT GGC CTG 
CGC CCG ATT GTG AAC ATG GAT TAT GTG GTG GGC GCG CGC ACC TTT CGC CGC GAA 
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AAA CGC GCG GAA CGC CTG ACC AGC CGC 
TAT GAA CGC GCG CGC CGC CCG GGC CTG 
GAT ATT CAT CGC GCG TGG CGC ACC TTT 
CCG CCG GAA CTG TAT TTT GTG AAA GTG 
CCG CAG GAT CGC CTG ACC GAA GTG ATT 
TAT TGC GTG CGC CGC TAT GCG GTG GTG 
AAA GCG TTT AAA AGC CAT GTG AGC ACC 
CAG TTT GTG GCG CAT CTG CAG GAA ACC 
GAA CAG AGC AGC AGC CTG AAC GAA GCG 
CGC TTT ATG TGC CAT CAT GCG GTG CGC 
CAG GGC ATT CCG CAG GGC AGC ATT CTG 
GGC GAT ATG GAA AAC AAA CTG TTT GCG 
CGC CTG GTG GAT GAT TTT CTG CTG GTG 
TTT CTG CGC ACC CTG GTG CGC GGC GTG 
CGC AAA ACC GTG GTG AAC TTT CCG GTG 
TTT GTG CAG ATG CCG GCG CAT GGC CTG 
ACC CGC ACC CTG GAA GTG CAG AGC GAT 
CGC GCG AGC CTG ACC TTT AAC CGC GGC 
AAA CTG TTT GGC GTG CTG CGC CTG AAA 
GTG AAC AGC CTG CAG ACC GTG TGC ACC 
GCG TAT CGC TTT CAT GCG TGC GTG CTG 
AAA AAC CCG ACC TTT TTT CTG CGC GTG 
AGC ATT CTG AAA GCG AAA AAC GCG GGC 
GGC CCG CTG CCG AGC GAA GCG GTG CAG 
AAA CTG ACC CGC CAT CGC GTG ACC TAT 
GCG CAG ACC CAG CTG AGC CGC AAA CTG 
GCG GCG GCG AAC CCG GCG CTG CCG AGC 



GTG AAA GCG CTG TTT AGC GTG CTG AAC 
CTG GGC GCG AGC GTG CTG GGC CTG GAT 
GTG CTG CGC GTG CGC GCG CAG GAT CCG 
GAT GTG ACC GGC GCG TAT GAT ACC ATT 
GCG AGC ATT ATT AAA CCG CAG AAC ACC 
CAG AAA GCG GCG CAT GGC CAT GTG CGC 
CTG ACC GAT CTG CAG CCG TAT ATG CGC 
AGC CCG CTG CGC GAT GCG GTG GTG ATT 
AGC AGC GGC CTG TTT GAT GTG TTT CTG 
ATT CGC GGC AAA AGC TAT GTG CAG TGC 
AGC ACC CTG CTG TGC AGC CTG TGC TAT 
GGC ATT CGC CGC GAT GGC CTG CTG CTG 
ACC CCG CAT CTG ACC CAT GCG AAA ACC 
CCG GAA TAT GGC TGC GTG GTG AAC CTG 
GAA GAT GAA GCG CTG GGC GGC ACC GCG 
TTT CCG TGG TGC GGC CTG CTG CTG GAT 
TAT AGC AGC TAT GCG CGC ACC AGC ATT 
TTT AAA GCG GGC CGC AAC ATG CGC CGC 
TGC CAT AGC CTG TTT CTG GAT CTG CAG 
AAC ATT TAT AAA ATT CTG CTG CTG CAG 
CAG CTG CCG TTT CAT CAG CAG GTG TGG 
ATT AGC GAT ACC GCG AGC CTG TGC TAT 
ATG AGC CTG GGC GCG AAA GGC GCG GCG 
TGG CTG TGC CAT CAG GCG TTT CTG CTG 
GTG CCG CTG CTG GGC AGC CTG CGC ACC 
CCG GGC ACC ACC CTG ACC GCG CTG GAA 
GAT TTT AAA ACC ATT CTG GAT 



Table 9B 

Fnteric Bacteria ffliffh Expressing Genes) 

1 ATGCCGCGTG CTCCGCGTTG CCGTGCTGTT CGTTCCCTGC TGCGTTCCCA 

51 CTACCGTGAA GTTCTGCCGC TGGCTACCTT CGTTCGTCGT CTGGGTCCGC 

101 AGGGTTGGCG TCTGGTTCAG CGTGGTGACC CGGCTGCTTT CCGTGCTCTG 

151 GTTGCTCAGT GCCTGGTTTG CGTTCCGTGG GACGCTCGTC CGCCGCCGGC 

201 TGCTCCGTCC TTCCGTCAGG TTTCCTGCCT GAAAGAACTG GTTGCTCGTG 

2 51 TTCTGCAGCG TCTGTGCGAA CGTGGTGCTA AAAACGTTCT GGCTTTCGGT 

301 TTCGCTCTGC TGGACGGTGC TCGTGGTGGT CCGCCGGAAG CTTTCACCAC 

351 CTCCGTTCGT TCCTACCTGC CGAACACCGT TACCGACGCT CTGCGTGGTT 

401 CCGGTGCTTG GGGTCTGCTG CTGCGTCGTG TTGGTGACGA CGTTCTGGTT 

451 CACCTGCTGG CTCGTTGCGC TCTGTTCGTT CTGGTTGCTC CGTCCTGCGC 

501 TTACCAGGTT TGCGGTCCGC CGCTGTACCA GCTGGGTGCT GCTACCCAGG 

551 CTCGTCCGCC GCCGCACGCT TCCGGTCCGC GTCGTCGTCT GGGTTGCGAA 

601 CGTGCTTGGA ACCACTCCGT TCGTGAAGCT GGTGTTCCGC TGGGTCTGCC 

651 GGCTCCGGGT GCTCGTCGTC GTGGTGGTTC CGCTTCCCGT TCCCTGCCGC 

701 TGCCGAAACG TCCGCGTCGT GGTGCTGCTC CGGAACCGGA ACGTACCCCG 

751 GTTGGTCAGG GTTCCTGGGC TCACCCGGGT CGTACCCGTG GTCCGTCCGA 

801 CCGTGGTTTC TGCGTTGTTT CCCCGGCTCG TCCGGCTGAA GAAGCTACCT 

851 CCCTGGAAGG TGCTCTGTCC GGTACCCGTC ACTCCCACCC GTCCGTTGGT 

901 CGTCAGCACC ACGCTGGTCC GCCGTCCACC TCCCGTCCGC CGCGTCCGTG 

951 GGACACCCCG TGCCCGCCGG TTTACGCTGA AACCAAACAC TTCCTGTACT 

1001 CCTCCGGTGA CAAAGAACAG CTGCGTCCGT CCTTCCTGCT GTCCTCCCTG 

1051 CGTCCGTCCC TGACCGGTGC TCGTCGTCTG GTTGAAACCA TCTTCCTGGG 

1101 TTCCCGTCCG TGGATGCCGG GTACCCCGCG TCGTCTGCCG CGTCTGCCGC 
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1151 AGCGTTACTG GCAGATGCGT CCGCTGTTCC TGGAACTGCT GGGTAACCAC 

1201 GCTCAGTGCC CGTACGGTGT TCTGCTGAAA ACCCACTGCC CGCTGCGTGC 

1251 TGCTGTTACC CCGGCTGCTG GTGTTTGCGC TCGTGAAAAA CCGCAGGGTT 

1301 CCGTTGCTGC TCCGGAAGAA GAAGACACCG ACCCGCGTCG TCTGGTTCAG 

5 1351 CTGCTGCGTC AGCACTCCTC CCCGTGGCAG GTTTACGGTT TCGTTCGTGC 

14 01 TTGCCTGCGT CGTCTGGTTC CGCCGGGTCT GTGGGGTTCC CGTCACAACG 

1451 AACGTCGTTT CCTGCGTAAC ACCAAAAAAT TCATCTCCCT GGGTAAACAC 

1501 GCTAAACTGT CCCTGCAGGA ACTGACCTGG AAAATGTCCG TTCGTGACTG 

1551 CGCTTGGCTG CGTCGTTCCC CGGGTGTTGG TTGCGTTCCG GCTGCTGAAC 

10 1601 ACCGTCTGCG TGAAGAAATC CTGGCTAAAT TCCTGCACTG GCTGATGTCC 

1651 GTTTACGTTG TTGAACTGCT GCGTTCCTTC TTCTACGTTA CCGAAACCAC 

1701 CTTCCAGAAA AACCGTCTGT TCTTCTACCG TAAATCCGTT TGGTCCAAAC 

1751 TGCAGTCCAT CGGTATCCGT CAGCACCTGA AACGTGTTCA GCTGCGTGAA 

1801 CTGTCCGAAG CTGAAGTTCG TCAGCACCGT GAAGCTCGTC CGGCTCTGCT 

15 1851 GACCTCCCGT CTGCGTTTCA TCCCGAAACC GGACGGTCTG CGTCCGATCG 

1901 TTAACATGGA CTACGTTGTT GGTGCTCGTA CCTTCCGTCG TGAAAAACGT 

1951 GCTGAACGTC TGACCTCCCG TGTTAAAGCT CTGTTCTCCG TTCTGAACTA 

2001 CGAACGTGCT CGTCGTCCGG GTCTGCTGGG TGCTTCCGTT CTGGGTCTGG 

2051 ACGACATCCA CCGTGCTTGG CGTACCTTCG TTCTGCGTGT TCGTGCTCAG 

20 2101 GACCCGCCGC CGGAACTGTA CTTCGTTAAA GTTGACGTTA CCGGTGCTTA 

2151 CGACACCATC CCGCAGGACC GTCTGACCGA AGTTATCGCT TCCATCATCA 

2201 AACCGCAGAA CACCTACTGC GTTCGTCGTT ACGCTGTTGT TCAGAAAGCT 

2251 GCTCACGGTC ACGTTCGTAA AGCTTTCAAA TCCCACGTTT CCACCCTGAC 

2301 CGACCTGCAG CCGTACATGC GTCAGTTCGT TGCTCACCTG CAGGAAACCT 

25 23 51 CCCCGCTGCG TGACGCTGTT GTTATCGAAC AGTCCTCCTC CCTGAACGAA 

2401 GCTTCCTCCG GTCTGTTCGA CGTTTTCCTG CGTTTCATGT GCCACCACGC 

2451 TGTTCGTATC CGTGGTAAAT CCTACGTTCA GTGCCAGGGT ATCCCGCAGG 

2 501 GTTCCATCCT GTCCACCCTG CTGTGCTCCC TGTGCTACGG TGACATGGAA 
2551 AACAAACTGT TCGCTGGTAT CCGTCGTGAC GGTCTGCTGC TGCGTCTGGT 

30 2601 TGACGACTTC CTGCTGGTTA CCCCGCACCT GACCCACGCT AAAACCTTCC 

2651 TGCGTACCCT GGTTCGTGGT GTTCCGGAAT ACGGTTGCGT TGTTAACCTG 

2701 CGTAAAACCG TTGTTAACTT CCCGGTTGAA GACGAAGCTC TGGGTGGTAC 

2751 CGCTTTCGTT CAGATGCCGG CTCACGGTCT GTTCCCGTGG TGCGGTCTGC 

2801 TGCTGGACAC CCGTACCCTG GAAGTTCAGT CCGACTACTC CTCCTACGCT 

35 2851 CGTACCTCCA TCCGTGCTTC CCTGACCTTC AACCGTGGTT TCAAAGCTGG 

2901 TCGTAACATG CGTCGTAAAC TGTTCGGTGT TCTGCGTCTG AAATGCCACT 

2951 CCCTGTTCCT GGACCTGCAG GTTAACTCCC TGCAGACCGT TTGCACCAAC 

3001 ATCTACAAAA TCCTGCTGCT GCAGGCTTAC CGTTTCCACG CTTGCGTTCT 

3 051 GCAGCTGCCG TTCCACCAGC AGGTTTGGAA AAACCCGACC TTCTTCCTGC 
40 3101 GTGTTATCTC CGACACCGCT TCCCTGTGCT ACTCCATCCT GAAAGCTAAA 

3151 AACGCTGGTA TGTCCCTGGG TGCTAAAGGT GCTGCTGGTC CGCTGCCGTC 

32 01 CGAAGCTGTT CAGTGGCTGT GCCACCAGGC TTTCCTGCTG AAACTGACCC 

3251 GTCACCGTGT TACCTACGTT CCGCTGCTGG GTTCCCTGCG TACCGCTCAG 

3301 ACCCAGCTGT CCCGTAAACT GCCGGGTACC ACCCTGACCG CTCTGGAAGC 

45 3351 TGCTGCTAAC CCGGCTCTGC CGTCCGACTT CAAAACCATC CTGGAC 

Table 9C 
Yeast (AU Genes) 

50 ATG CCA AGA GCT CCA AGA TGT AGA GCT GTT AGA TCT TTG TTG AGA TCT CAT TAT 
AGA GAA GTT TTG CCA TTG GCT ACT TTT GTT AGA AGA TTG GGT CCA CAA GGT TGG 
AGA TTG GTT CAA AGA GGT GAT CCA GCT GCT TTT AGA GCT TTG GTT GCT CAA TGT 
TTG GTT TGT GTT CCA TGG GAT GCT AGA CCA CCA CCA GCT GCT CCA TCT TTT AGA 
CAA GTT TCT TGT TTG AAA GAA TTG GTT GCT AGA GTT TTG CAA AGA TTG TGT GAA 

55 AGA GGT GCT AAA AAT GTT TTG GCT TTT GGT TTT GCT TTG TTG GAT GGT GCT AGA 
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GGT GGT CCA CCA GAA GCT TTT ACT 
GTT ACT GAT GCT TTG AGA GGT TCT 
GGT GAT GAT GTT TTG GTT CAT TTG 
GCT CCA TCT TGT GCT TAT CAA GTT 
5 GCT ACT CAA GCT AGA CCA CCA CCA 
TGT GAA AGA GCT TGG AAT CAT TCT 
CCA GCT CCA GGT GCT AGA AGA AGA 
CCA AAA AGA CCA AGA AGA GGT GCT 
CAA GGT TCT TGG GCT CAT CCA GGT 
10 TGT GTT GTT TCT CCA GCT AGA CCA 
TTG TCT GGT ACT AGA CAT TCT CAT 
CCA CCA TCT ACT TCT AGA CCA CCA 
TAT GCT GAA ACT AAA CAT TTT TTG 
CCA TCT TTT TTG TTG TCT TCT TTG 
15 GTT GAA ACT ATT TTT TTG GGT TCT 
TTG CCA AGA TTG CCA CAA AGA TAT 
TTG GGT AAT CAT GCT CAA TGT CCA 
TTG AGA GCT GCT GTT ACT CCA GCT 
GGT TCT GTT GCT GCT CCA GAA GAA 
20 TTG TTG AGA CAA CAT TCT TCT CCA 
TTG AGA AGA TTG GTT CCA CCA GGT 
TTT TTG AGA AAT ACT AAA AAA TTT 
TTG CAA GAA TTG ACT TGG AAA ATG 
TCT CCA GGT GTT GGT TGT GTT CCA 
25 TTG GCT AAA TTT TTG CAT TGG TTG 
TCT TTT TTT TAT GTT ACT GAA ACT 
AGA AAA TCT GTT TGG TCT AAA TTG 
AGA GTT CAA TTG AGA GAA TTG TCT 
AGA CCA GCT TTG TTG ACT TCT AGA 
30 AGA CCA ATT GTT AAT ATG GAT TAT 
AAA AGA GCT GAA AGA TTG ACT TCT 
TAT GAA AGA GCT AGA AGA CCA GGT 
GAT ATT CAT AGA GCT TGG AGA ACT 
CCA CCA GAA TTG TAT TTT GTT AAA 
35 CCA CAA GAT AGA TTG ACT GAA GTT 
TAT TGT GTT AGA AGA TAT GCT GTT 
AAA GCT TTT AAA TCT CAT GTT TCT 
CAA TTT GTT GCT CAT TTG CAA GAA 
GAA CAA TCT TCT TCT TTG AAT GAA 
40 AGA TTT ATG TGT CAT CAT GCT GTT 
CAA GGT ATT CCA CAA GGT TCT ATT 
GGT GAT ATG GAA AAT AAA TTG TTT 
AGA TTG GTT GAT GAT TTT TTG TTG 
TTT TTG AGA ACT TTG GTT AGA GGT 
45 AGA AAA ACT GTT GTT AAT TTT CCA 
TTT GTT CAA ATG CCA GCT CAT GGT 
ACT AGA ACT TTG GAA GTT CAA TCT 
AGA GCT TCT TTG ACT TTT AAT AGA 
AAA TTG TTT GGT GTT TTG AGA TTG 
50 GTT AAT TCT TTG CAA ACT GTT TGT 
GCT TAT AGA TTT CAT GCT TGT GTT 
AAA AAT CCA ACT TTT TTT TTG AGA 
TCT ATT TTG AAA GCT AAA AAT GCT 
GGT CCA TTG CCA TCT GAA GCT GTT 
55 AAA TTG ACT AGA CAT AGA GTT ACT 



ACT TCT GTT AGA TCT TAT TTG CCA AAT ACT 
GGT GCT TGG GGT TTG TTG TTG AGA AGA GTT 
TTG GCT AGA TGT GCT TTG TTT GTT TTG GTT 
TGT GGT CCA CCA TTG TAT CAA TTG GGT GCT 
CAT GCT TCT GGT CCA AGA AGA AGA TTG GGT 
GTT AGA GAA GCT GGT GTT CCA TTG GGT TTG 
GGT GGT TCT GCT TCT AGA TCT TTG CCA TTG 
GCT CCA GAA CCA GAA AGA ACT CCA GTT GGT 
AGA ACT AGA GGT CCA TCT GAT AGA GGT TTT 
GCT GAA GAA GCT ACT TCT TTG GAA GGT GCT 
CCA TCT GTT GGT AGA CAA CAT CAT GCT GGT 
AGA CCA TGG GAT ACT CCA TGT CCA CCA GTT 
TAT TCT TCT GGT GAT AAA GAA CAA TTG AGA 
AGA CCA TCT TTG ACT GGT GCT AGA AGA TTG 
AGA CCA TGG ATG CCA GGT ACT CCA AGA AGA 
TGG CAA ATG AGA CCA TTG TTT TTG GAA TTG 
TAT GGT GTT TTG TTG AAA ACT CAT TGT CCA 
GCT GGT GTT TGT GCT AGA GAA AAA CCA CAA 
GAA GAT ACT GAT CCA AGA AGA TTG GTT CAA 
TGG CAA GTT TAT GGT TTT GTT AGA GCT TGT 
TTG TGG GGT TCT AGA CAT AAT GAA AGA AGA 
ATT TCT TTG GGT AAA CAT GCT AAA TTG TCT 
TCT GTT AGA GAT TGT GCT TGG TTG AGA AGA 
GCT GCT GAA CAT AGA TTG AGA GAA GAA ATT 
ATG TCT GTT TAT GTT GTT GAA TTG TTG AGA 
ACT TTT CAA AAA AAT AGA TTG TTT TTT TAT 
CAA TCT ATT GGT ATT AGA CAA CAT TTG AAA 
GAA GCT GAA GTT AGA CAA CAT AGA GAA GCT 
TTG AGA TTT ATT CCA AAA CCA GAT GGT TTG 
GTT GTT GGT GCT AGA ACT TTT AGA AGA GAA 
AGA GTT AAA GCT TTG TTT TCT GTT TTG AAT 
TTG TTG GGT GCT TCT GTT TTG GGT TTG GAT 
TTT GTT TTG AGA GTT AGA GCT CAA GAT CCA 
GTT GAT GTT ACT GGT GCT TAT GAT ACT ATT 
ATT GCT TCT ATT ATT AAA CCA CAA AAT ACT 
GTT CAA AAA GCT GCT CAT GGT CAT GTT AGA 
ACT TTG ACT GAT TTG CAA CCA TAT ATG AGA 
ACT TCT CCA TTG AGA GAT GCT GTT GTT ATT 
GCT TCT TCT GGT TTG TTT GAT GTT TTT TTG 
AGA ATT AGA GGT AAA TCT TAT GTT CAA TGT 
TTG TCT ACT TTG TTG TGT TCT TTG TGT TAT 
GCT GGT ATT AGA AGA GAT GGT TTG TTG TTG 
GTT ACT CCA CAT TTG ACT CAT GCT AAA ACT 
GTT CCA GAA TAT GGT TGT GTT GTT AAT TTG 
GTT GAA GAT GAA GCT TTG GGT GGT ACT GCT 
TTG TTT CCA TGG TGT GGT TTG TTG TTG GAT 
GAT TAT TCT TCT TAT GCT AGA ACT TCT ATT 
GGT TTT AAA GCT GGT AGA AAT ATG AGA AGA 
AAA TGT CAT TCT TTG TTT TTG GAT TTG CAA 
ACT AAT ATT TAT AAA ATT TTG TTG TTG CAA 
TTG CAA TTG CCA TTT CAT CAA CAA GTT TGG 
GTT ATT TCT GAT ACT GCT TCT TTG TGT TAT 
GGT ATG TCT TTG GGT GCT AAA GGT GCT GCT 
CAA TGG TTG TGT CAT CAA GCT TTT TTG TTG 
TAT GTT CCA TTG TTG GGT TCT TTG AGA ACT 
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GCT CAA ACT CAA TTG TCT AGA AAA TTG CCA GGT ACT ACT TTG ACT GCT TTG GAA 
GCT GCT GCT AAT CCA GCT TTG CCA TCT GAT TTT AAA ACT ATT TTG GAT 

Table 9D 
Yeast (High Expressing Genes) 

ATG CCA AGA GCT CCA AGA TGT AGA GCT GTT AGA TCT TTG TTG AGA TCT CAC TAC 
AGA GAA GTT TTG CCA TTG GCT ACT TTC GTT AGA AGA TTG GGT CCA CAA GGT TGG 
AGA TTG GTT CAA AGA GGT GAC CCA GCT GCT TTC AGA GCT TTG GTT GCT CAA TGT 
TTG GTT TGT GTT CCA TGG GAC GCT AGA CCA CCA CCA GCT GCT CCA TCT TTC AGA 
CAA GTT TCT TGT TTG AAG GAA TTG GTT GCT AGA GTT TTG CAA AGA TTG TGT GAA 
AGA GGT GCT AAG AAC GTT TTG GCT TTC GGT TTC GCT TTG TTG GAC GGT GCT AGA 
GGT GGT CCA CCA GAA GCT TTC ACT ACT TCT GTT AGA TCT TAC TTG CCA AAC ACT 
GTT ACT GAC GCT TTG AGA GGT TCT GGT GCT TGG GGT TTG TTG TTG AGA AGA GTT 
GGT GAC GAC GTT TTG GTT CAC TTG TTG GCT AGA TGT GCT TTG TTC GTT TTG GTT 
GCT CCA TCT TGT GCT TAC CAA GTT TGT GGT CCA CCA TTG TAC CAA TTG GGT GCT 
GCT ACT CAA GCT AGA CCA CCA CCA CAC GCT TCT GGT CCA AGA AGA AGA TTG GGT 
TGT GAA AGA GCT TGG AAC CAC TCT GTT AGA GAA GCT GGT GTT CCA TTG GGT TTG 
CCA GCT CCA GGT GCT AGA AGA AGA GGT GGT TCT GCT TCT AGA TCT TTG CCA TTG 
CCA AAG AGA CCA AGA AGA GGT GCT GCT CCA GAA CCA GAA AGA ACT CCA GTT GGT 
CAA GGT TCT TGG GCT CAC CCA GGT AGA ACT AGA GGT CCA TCT GAC AGA GGT TTC 
TGT GTT GTT TCT CCA GCT AGA CCA GCT GAA GAA GCT ACT TCT TTG GAA GGT GCT 
TTG TCT GGT ACT AGA CAC TCT CAC CCA TCT GTT GGT AGA CAA CAC CAC GCT GGT 
CCA CCA TCT ACT TCT AGA CCA CCA AGA CCA TGG GAC ACT CCA TGT CCA CCA GTT 
TAC GCT GAA ACT AAG CAC TTC TTG TAC TCT TCT GGT GAC AAG GAA CAA TTG AGA 
CCA TCT TTC TTG TTG TCT TCT TTG AGA CCA TCT TTG ACT GGT GCT AGA AGA TTG 
GTT GAA ACT ATT TTC TTG GGT TCT AGA CCA TGG ATG CCA GGT ACT CCA AGA AGA 
TTG CCA AGA TTG CCA CAA AGA TAC TGG CAA ATG AGA CCA TTG TTC TTG GAA TTG 
TTG GGT AAC CAC GCT CAA TGT CCA TAC GGT GTT TTG TTG AAG ACT CAC TGT CCA 
TTG AGA GCT GCT GTT ACT CCA GCT GCT GGT GTT TGT GCT AGA GAA AAG CCA CAA 
GGT TCT GTT GCT GCT CCA GAA GAA GAA GAC ACT GAC CCA AGA AGA TTG GTT CAA 
TTG TTG AGA CAA CAC TCT TCT CCA TGG CAA GTT TAC GGT TTC GTT AGA GCT TGT 
TTG AGA AGA TTG GTT CCA CCA GGT TTG TGG GGT TCT AGA CAC AAC GAA AGA AGA 
TTC TTG AGA AAC ACT AAG AAG TTC ATT TCT TTG GGT AAG CAC GCT AAG TTG TCT 
TTG CAA GAA TTG ACT TGG AAG ATG TCT GTT AGA GAC TGT GCT TGG TTG AGA AGA 
TCT CCA GGT GTT GGT TGT GTT CCA GCT GCT GAA CAC AGA TTG AGA GAA GAA ATT 
TTG GCT AAG TTC TTG CAC TGG TTG ATG TCT GTT TAC GTT GTT GAA TTG TTG AGA 
TCT TTC TTC TAC GTT ACT GAA ACT ACT TTC CAA AAG AAC AGA TTG TTC TTC TAC 
AGA AAG TCT GTT TGG TCT AAG TTG CAA TCT ATT GGT ATT AGA CAA CAC TTG AAG 
AGA GTT CAA TTG AGA GAA TTG TCT GAA GCT GAA GTT AGA CAA CAC AGA GAA GCT 
AGA CCA GCT TTG TTG ACT TCT AGA TTG AGA TTC ATT CCA AAG CCA GAC GGT TTG 
AGA CCA ATT GTT AAC ATG GAC TAC GTT GTT GGT GCT AGA ACT TTC AGA AGA GAA 
AAG AGA GCT GAA AGA TTG ACT TCT AGA GTT AAG GCT TTG TTC TCT GTT TTG AAC 
TAC GAA AGA GCT AGA AGA CCA GGT TTG TTG GGT GCT TCT GTT TTG GGT TTG GAC 
GAC ATT CAC AGA GCT TGG AGA ACT TTC GTT TTG AGA GTT AGA GCT CAA GAC CCA 
CCA CCA GAA TTG TAC TTC GTT AAG GTT GAC GTT ACT GGT GCT TAC GAC ACT ATT 
CCA CAA GAC AGA TTG ACT GAA GTT ATT GCT TCT ATT ATT AAG CCA CAA AAC ACT 
TAC TGT GTT AGA AGA TAC GCT GTT GTT CAA AAG GCT GCT CAC GGT CAC GTT AGA 
AAG GCT TTC AAG TCT CAC GTT TCT ACT TTG ACT GAC TTG CAA CCA TAC ATG AGA 
CAA TTC GTT GCT CAC TTG CAA GAA ACT TCT CCA TTG AGA GAC GCT GTT GTT ATT 
GAA CAA TCT TCT TCT TTG AAC GAA GCT TCT TCT GGT TTG TTC GAC GTT TTC TTG 
AGA TTC ATG TGT CAC CAC GCT GTT AGA ATT AGA GGT AAG TCT TAC GTT CAA TGT 
CAA GGT ATT CCA CAA GGT TCT ATT TTG TCT ACT TTG TTG TGT TCT TTG TGT TAC 
GGT GAC ATG GAA AAC AAG TTG TTC GCT GGT ATT AGA AGA GAC GGT TTG TTG TTG 
AGA TTG GTT GAC GAC TTC TTG TTG GTT ACT CCA CAC TTG ACT CAC GCT AAG ACT 
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TTC 


TTG 


AGA 


ACT 


TTG 


GTT 


AGA 


GGT 


GTT 


CCA 


GAA 


TAC 


GGT 


TGT 


GTT 


GTT 


AAC 


TTG 


AGA 


AAG 


ACT 


GTT 


GTT 


AAC 


TTC 


CCA 


GTT 


GAA 


GAC 


GAA 


GCT 


TTG 


GGT 


GGT 


ACT 


GCT 


TTC 


GTT 


CAA 


ATG 


CCA 


GCT 


CAC 


GGT 


TTG 


TTC 


CCA 


TGG 


TGT 


GGT 


TTG 


TTG 


TTG 


GAC 


ACT 


AGA 


ACT 


TTG 


GAA 


GTT 


CAA 


TCT 


GAC 


TAC 


TCT 


TCT 


TAC 


GCT 


AGA 


ACT 


TCT 


ATT 


AGA 


GCT 


TCT 


TTG 


ACT 


TTC 


AAC 


AGA 


GGT 


TTC 


AAG 


GCT 


GGT 


AGA 


AAC 


ATG 


AGA 


AGA 


AAG 


TTG 


TTC 


GGT 


GTT 


TTG 


AGA 


TTG 


AAG 


TGT 


CAC 


TCT 


TTG 


TTC 


TTG 


GAC 


TTG 


CAA 


GTT 


AAC 


TCT 


TTG 


CAA 


ACT 


GTT 


TGT 


ACT 


AAC 


ATT 


TAC 


AAG 


ATT 


TTG 


TTG 


TTG 


CAA 


GCT 


TAC 


AGA 


TTC 


CAC 


GCT 


TGT 


GTT 


TTG 


CAA 


TTG 


CCA 


TTC 


CAC 


CAA 


CAA 


GTT 


TGG 


AAG 


AAC 


CCA 


ACT 


TTC 


TTC 


TTG 


AGA 


GTT 


ATT 


TCT 


GAC 


ACT 


GCT 


TCT 


TTG 


TGT 


TAC 


TCT 


ATT 


TTG 


AAG 


GCT 


AAG 


AAC 


GCT 


GGT 


ATG 


TCT 


TTG 


GGT 


GCT 


AAG 


GGT 


GCT 


GCT 


GGT 


CCA 


TTG 


CCA 


TCT 


GAA 


GCT 


GTT 


CAA 


TGG 


TTG 


TGT 


CAC 


CAA 


GCT 


TTC 


TTG 


TTG 


AAG 


TTG 


ACT 


AGA 


CAC 


AGA 


GTT 


ACT 


TAC 


GTT 


CCA 


TTG 


TTG 


GGT 


TCT 


TTG 


AGA 


ACT 


GCT 


CAA 


ACT 


CAA 


TTG 


TCT 


AGA 


AAG 


TTG 


CCA 


GGT 


ACT 


ACT 


TTG 


ACT 


GCT 


TTG 


GAA 


GCT 


GCT 


GCT 


AAC 


CCA 


GCT 


TTG 


CCA 


TCT 


GAC 


TTC 


AAG 


ACT 


ATT 


TTG 


GAC 
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Table 9E 

"Generic" hTRT Protein Encoding Sequence 



ZU 


ATG 


CCA 


CGT 


GCC 


CCA 


CGI 


1G1 


CGI 




ul 1 


new 


x V, x 


J. J- w 


TTCi 


V_- o X 


TCT 




TAC 


CGT 




GAA 


GTT 


TTG 


CCA 


TTG 


GCC 


ACC 


TTC 


Gl 1 




CGI 


1 1G 


PPT 
GG1 


CCA 
V— L.A 


CAA 
V_AA 


new 






1 xo 




GTT 


CAA 


CGT 


GGT 


GAT 


CCA 


GCC 


GCC 


i 1C 


CGI 


GCC 


1 1G 


Gl 1 


ccc 


CAA 
\_AA 


r rr"v 

1\jX 


1 1G 


\J J. i. 


T , ri r r 

lui 




GTT 


CCA 


TGG 


GAT 


GCC 


CGT 


CCA 


CCA 


CCA 


GCC 


GCC 


CCA 


1C 1 


11C 


CGI 


GAA 


Gl 1 


1 G 1 






TTG 


AAA 


GAA 


TTG 


GTT 


GCC 


CGT 


GTT 


TTG 


CAA 


CGI 


1 1G 


1G1 


GAA 


CGI 


GGl 


GCC 


2i 2i & 
AAA 


7i AC 


Id 


GTT 


TTG 


GCC 


TTC 


GGT 


TTC 


GCC 


Mill 1/1 

TTG 


TTG 


bAi 


GG1 


ccc 




U\3l 






CCA 




f3CC 




TTC 


ACC 


ACC 


TCT 


GTT 


CGT 


TCT 


TAC 


I 1G 


LLA 


AAC 


ACC 


Gl 1 


ALL 


CAT 
GA1 


GGG 


1 1G 








TCT 


GGT 


GCC 


TGG 


GGT 


TTG 


TTG 


TTG 


CGT 


CGI 


Gil 


GG1 


GA1 


GA1 


Gl 1 


1 1G 


Gl 1 


CTiC 
GAG 


X XV3 




TTG 


GCC 


CGT 


TGT 


GCC 


TTG 


TTC 


GTT 


TTG 


GTT 


GCC 


/-1/-17\ 

CCA 


1C1 


1G1 


GCC 


1AC 


CAA 


Gl 1 


1G1 




GGT 


CCA 


CCA 


TTG 


TAC 


CAA 


TTG 


GGT 


GCC 


GCC 


ACC 


CAA 


GCC 


CGI 


CCA 


CCA 


CCA 


CAC 




30 


TCT 


GGT 


CCA 


CGT 


CGT 


CGT 


TTG 


GGT 


TGT 


GAA 


CGT 


/~t /-i/-t 

GCC 


TGG 


AAC 


CAC 


1C1 


/^l Mill 

Gl 1 


CGI 


GAA 




GCC 


GGT 


GTT 


CCA 


TTG 


GGT 


TTG 


CCA 


GCC 


CCA 


GGT 


GCC 


CGT 


CGT 


CGT 


GGT 


GGT 


TCT 


GCC 




TCT 


CGT 


TCT 


TTG 


CCA 


TTG 


CCA 


AAA 


CGT 


CCA 


CGT 


CGT 


GGT 


GCC 


GCC 


CCA 


GAA 


CCA 


GAA 




CGT 


ACC 


CCA 


GTT 


GGT 


CAA 


GGT 


TCT 


TGG 


GCC 


CAC 


CCA 


GGT 


CGT 


ACC 


CGT 


GGT 


CCA 


TCT 




GAT 


CGT 


GGT 


TTC 


TGT 


GTT 


GTT 


TCT 


CCA 


GCC 


CGT 


CCA 


GCC 


GAA 


GAA 


GCC 


ACC 


TCT 


TTG 


35 


GAA 


GGT 


GCC 


TTG 


TCT 


GGT 


ACC 


CGT 


CAC 


TCT 


CAC 


CCA 


TCT 


GTT 


GGT 


CGT 


CAA 


CAC 


CAC 




GCC 


GGT 


CCA 


CCA 


TCT 


ACC 


TCT 


CGT 


CCA 


CCA 


CGT 


CCA 


TGG 


GAT 


ACC 


CCA 


TGT 


CCA 


CCA 




GTT 


TAC 


GCC 


GAA 


ACC 


AAA 


CAC 


TTC 


TTG 


TAC 


TCT 


TCT 


GGT 


GAT 


AAA 


GAA 


CAA 


TTG 


CGT 




CCA 


TCT 


TTC 


TTG 


TTG 


TCT 


TCT 


TTG 


CGT 


CCA 


TCT 


TTG 


ACC 


GGT 


GCC 


CGT 


CGT 


TTG 


GTT 




GAA 


ACC 


ATT 


TTC 


TTG 


GGT 


TCT 


CGT 


CCA 


TGG 


ATG 


CCA 


GGT 


ACC 


CCA 


CGT 


CGT 


TTG 


CCA 


40 


CGT 


TTG 


CCA 


CAA 


CGT 


TAC 


TGG 


CAA 


ATG 


CGT 


CCA 


TTG 


TTC 


TTG 


GAA 


TTG 


TTG 


GGT 


AAC 




CAC 


GCC 


CAA 


TGT 


CCA 


TAC 


GGT 


GTT 


TTG 


TTG 


AAA 


ACC 


CAC 


TGT 


CCA 


TTG 


CGT 


GCC 


GCC 




GTT 


ACC 


CCA 


GCC 


GCC 


GGT 


GTT 


TGT 


GCC 


CGT 


GAA 


AAA 


CCA 


CAA 


GGT 


TCT 


GTT 


GCC 


GCC 




CCA 


GAA 


GAA 


GAA 


GAT 


ACC 


GAT 


CCA 


CGT 


CGT 


TTG 


GTT 


CAA 


TTG 


TTG 


CGT 


CAA 


CAC 


TCT 




TCT 


CCA 


TGG 


CAA 


GTT 


TAC 


GGT 


TTC 


GTT 


CGT 


GCC 


TGT 


TTG 


CGT 


CGT 


TTG 


GTT 


CCA 


CCA 


45 


GGT 


TTG 


TGG 


GGT 


TCT 


CGT 


CAC 


AAC 


GAA 


CGT 


CGT 


TTC 


TTG 


CGT 


AAC 


ACC 


AAA 


AAA 


TTC 




ATT 


TCT 


TTG 


GGT 


AAA 


CAC 


GCC 


AAA 


TTG 


TCT 


TTG 


CAA 


GAA 


TTG 


ACC 


TGG 


AAA 


ATG 


TCT 




GTT 


CGT 


GAT 


TGT 


GCC 


TGG 


TTG 


CGT 


CGT 


TCT 


CCA 


GGT 


GTT 


GGT 


TGT 


GTT 


CCA 


GCC 


GCC 




GAA 


CAC 


CGT 


TTG 


CGT 


GAA 


GAA 


ATT 


TTG 


GCC 


AAA 


TTC 


TTG 


CAC 


TGG 


TTG 


ATG 


TCT 


GTT 




TAC 


GTT 


GTT 


GAA 


TTG 


TTG 


CGT 


TCT 


TTC 


TTC 


TAC 


GTT 


ACC 


GAA 


ACC 


ACC 


TTC 


CAA 


AAA 


50 


AAC 


CGT 


TTG 


TTC 


TTC 


TAC 


CGT 


AAA 


TCT 


GTT 


TGG 


TCT 


AAA 


TTG 


CAA 


TCT 


ATT 


GGT 


ATT 




CGT 


CAA 


CAC 


TTG 


AAA 


CGT 


GTT 


CAA 


TTG 


CGT 


GAA 


TTG 


TCT 


GAA 


GCC 


GAA 


GTT 


CGT 


CAA 




CAC 


CGT 


GAA 


GCC 


CGT 


CCA 


GCC 


TTG 


TTG 


ACC 


TCT 


CGT 


TTG 


CGT 


TTC 


ATT 


CCA 


AAA 


CCA 




GAT 


GGT 


TTG 


CGT 


CCA 


ATT 


GTT 


AAC 


ATG 


GAT 


TAC 


GTT 


GTT 


GGT 


GCC 


CGT 


ACC 


TTC 


CGT 




CGT 


GAA 


AAA 


CGT 


GCC 


GAA 


CGT 


TTG 


ACC 


TCT 


CGT 


GTT 


AAA 


GCC 


TTG 


TTC 


TCT 


GTT 


TTG 


55 


AAC 


TAC 


GAA 


CGT 


GCC 


CGT 


CGT 


CCA 


GGT 


TTG 


TTG 


GGT 


GCC 


TCT 


GTT 


TTG 


GGT 


TTG 


GAT 
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GAT 


ATT 


CAC 


CGT 


GCC 


TGG 


CGT 


ACC 


TTC 


GTT 


TTG 


CGT 


GTT 


CGT 


GCC 


CAA 


GAT 


CCA 


CCA 


CCA 


GAA 


TTG 


TAC 


TTC 


GTT 


AAA 


GTT 


GAT 


GTT 


ACC 


GGT 


GCC 


TAC 


GAT 


ACC 


ATT 


CCA 


CAA 


GAT 


CGT 


TTG 


ACC 


GAA 


GTT 


ATT 


GCC 


TCT 


ATT 


ATT 


AAA 


CCA 


CAA 


AAC 


ACC 


TAC 


TGT 


GTT 


CGT 


CGT 


TAC 


GCC 


GTT 


GTT 


CAA 


AAA 


GCC 


GCC 


CAC 


GGT 


CAC 


GTT 


CGT 


AAA 


GCC 


TTC 


AAA 


TCT 


CAC 


GTT 


TCT 


ACC 


TTG 


ACC 


GAT 


TTG 


CAA 


CCA 


TAC 


ATG 


CGT 


CAA 


TTC 


GTT 


GCC 


CAC 


TTG 


CAA 


GAA 


ACC 


TCT 


CCA 


TTG 


CGT 


GAT 


GCC 


GTT 


GTT 


ATT 


GAA 


CAA 


TCT 


TCT 


TCT 


TTG 


AAC 


GAA 


GCC 


TCT 


TCT 


GGT 


TTG 


TTC 


GAT 


GTT 


TTC 


TTG 


CGT 


TTC 


ATG 


TGT 


CAC 


CAC 


GCC 


GTT 


CGT 


ATT 


CGT 


GGT 


AAA 


TCT 


TAC 


GTT 


CAA 


TGT 


CAA 


GGT 


ATT 


CCA 


CAA 


GGT 


TCT 


ATT 


TTG 


TCT 


ACC 


TTG 


TTG 


TGT 


TCT 


TTG 


TGT 


TAC 


GGT 


GAT 


ATG 


GAA 


AAC 


AAA 


TTG 


TTC 


GCC 


GGT 


ATT 


CGT 


CGT 


GAT 


GGT 


TTG 


TTG 


TTG 


CGT 


TTG 


GTT 


GAT 


GAT 


TTC 


TTG 


TTG 


GTT 


ACC 


CCA 


CAC 


TTG 


ACC 


CAC 


GCC 


AAA 


ACC 


TTC 


TTG 


CGT 


ACC 


TTG 


GTT 


CGT 


GGT 


GTT 


CCA 


GAA 


TAC 


GGT 


TGT 


GTT 


GTT 


AAC 


TTG 


CGT 


AAA 


ACC 


GTT 


GTT 


AAC 


TTC 


CCA 


GTT 


GAA 


GAT 


GAA 


GCC 


TTG 


GGT 


GGT 


ACC 


GCC 


TTC 


GTT 


CAA 


ATG 


CCA 


GCC 


CAC 


GGT 


TTG 


TTC 


CCA 


TGG 


TGT 


GGT 


TTG 


TTG 


TTG 


GAT 


ACC 


CGT 


ACC 


TTG 


GAA 


GTT 


CAA 


TCT 


GAT 


TAC 


TCT 


TCT 


TAC 


GCC 


CGT 


ACC 


TCT 


ATT 


CGT 


GCC 


TCT 


TTG 


ACC 


TTC 


AAC 


CGT 


GGT 


TTC 


AAA 


GCC 


GGT 


CGT 


AAC 


ATG 


CGT 


CGT 


AAA 


TTG 


TTC 


GGT 


GTT 


TTG 


CGT 


TTG 


AAA 


TGT 


CAC 


TCT 


TTG 


TTC 


TTG 


GAT 


TTG 


CAA 


GTT 


AAC 


TCT 


TTG 


CAA 


ACC 


GTT 


TGT 


ACC 


AAC 


ATT 


TAC 


AAA 


ATT 


TTG 


TTG 


TTG 


CAA 


GCC 


TAC 


CGT 


TTC 


CAC 


GCC 


TGT 


GTT 


TTG 


CAA 


TTG 


CCA 


TTC 


CAC 


CAA 


CAA 


GTT 


TGG 


AAA 


AAC 


CCA 


ACC 


TTC 


TTC 


TTG 


CGT 


GTT 


ATT 


TCT 


GAT 


ACC 


GCC 


TCT 


TTG 


TGT 


TAC 


TCT 


ATT 


TTG 


AAA 


GCC 


AAA 


AAC 


GCC 


GGT 


ATG 


TCT 


TTG 


GGT 


GCC 


AAA 


GGT 


GCC 


GCC 


GGT 


CCA 


TTG 


CCA 


TCT 


GAA 


GCC 


GTT 


CAA 


TGG 


TTG 


TGT 


CAC 


CAA 


GCC 


TTC 


TTG 


TTG 


AAA 


TTG 


ACC 


CGT 


CAC 


CGT 


GTT 


ACC 


TAC 


GTT 


CCA 


TTG 


TTG 


GGT 


TCT 


TTG 


CGT 


ACC 


GCC 


CAA 


ACC 


CAA 


TTG 


TCT 


CGT 


AAA 


TTG 


CCA 


GGT 


ACC 


ACC 


TTG 


ACC 


GCC 


TTG 


GAA 


GCC 


GCC 


GCC 


AAC 


CCA 


GCC 


TTG 


CCA 


TCT 


GAT 


TTC 


AAA 


ACC 


ATT 


TTG 


GAT 



















Following determination of the desired nucleotide sequence for the hTRT 
protein-encoding polynucleotide, the polynucleotide can be made by any suitable method 
including de novo chemical synthesis, directed mutagenesis of a synthetic or naturally 
occurring TRT gene or cDNA, or a combination of these methods. In one exemplary 
embodiment, oligonucleotides (typically 50-100 bases in length) are synthesized with a 5 ? 
phosphate group and include approximately 10-base overhangs (relative to adjacent 
oligonucleotides in the assembled gene) to direct subsequent ligations. Following 
purification and desalting, each oligonucleotide is annealed to its complement (e.g., by 
combining pairs of oligonucleotides in equimolar amounts in a neutral pH buffer with 
50-200 mM NaCl and 0.5 mM MgCl 2 ). Annealing may be monitored by native PAGE. The 
resulting double-stranded oligonucleotides are ligated to their neighbors in pairs. After each 
ligation the products are gel-purified, then ligated to the appropriate (neighboring) double- 
stranded DNAs. In this manner, fragments of approximately 600-800 basepairs are built up. 
These intermediate fragments are then cloned into vectors and sequenced. The fragments are 
then combined into a single vector (resulting in a vector containing a polynucleotide with the 
desired hTRT protein-encoding sequence). This step is facilitated by using restriction sites 
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present in, or engineered into, the polynucleotide sequence. Alternatively, the fragments can 
be built up by ligation until the complete cDNA is assembled and the assembled sequence 
cloned into a vector. Numerous other alternative methods and approaches will be apparent to 
those of skill in the art. 

5 Table 10A shows an exemplary set of oligonucleotide that can be used to 

produce a polynucleotide, shown in Table 10B, that employs a codon distribution 
preferentially used by highly expressed genes in E. coli. The sequence in Table 5B contains 
silent changes to some codons to introduce useful restriction sites spaced every 300-800 base 
pairs, to facilitate subcloning and modification. Oligonucleotide pairs for the initial 

1 0 annealing steps are indicated by the labels "T" (top strand) and "B" (bottom strand). The full- 
length polynucleotide (Table 10B) encodes the hTRT protein (with the start codon at 
nucleotides 28-30) and contains Sac I and Xho I sites at the termini flanking the open reading 
frame, which are useful for cloning into a variety of vectors (e.g., pBluescript II KS, 
Stratagene Inc., San Diego CA). Once cloned into an appropriate vector, the hTRT sequence 

1 5 may be expressed, modified (e.g., by site directed or cassette mutagenesis), subcloned, or 
otherwise used or manipulated. In one embodiment, the polynucleotide is subcloned into a 
pET vector containing a T7 RNA polymerase promoter (Novagen Inc., Madison, WI) and 
introduced into an E. coli strain having an inducible T7 polymerase (Novagen Inc., Madison, 
WI). One advantage to the pET system is that the E. coli culture may be grown before the T7 

20 RNA polymerase gene is induced, resulting in very high levels of transcription and 

minimizing the effect of any potential detrimental effect of the expressed protein on the cells. 



Table 10A: Oligonucleotides 

IB CCAGCGGCAGAACTTCGCGATAGTGGGAACGCAGCAGGGAACGAACAGCACGGCAACGCG 

GAGCACGCGGCATATGGTCGACTCTAGAGCTCCCGCGTGC 
IT GCACGCGGGAGCTCTAGAGTCGACCATATGCCGCGTGCTCCGCGTTGCCGTGCTGTTCGTTC 

CCTGCTGCGTTCCCACTATCGCGAAGTT 
2B GGCACTGAGCAACCAGAGCACGGAAAGCAGCCGGGTCACCACGCTGAACCAGACGCCAAC 



TABLE 10 



25 



SYNTHESIS OF hTRT POL YNT TCT .EOTIDP HAVTNG ALTERNATIVE 



CODON DISTRIBUTION 
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TGACCCGGCTGCTTTCCGTGCTCTGGTT 
3B GAACACGAGCAACCAGTTCTTTCAGGCAGGAAACCTGACGGAAGGACGGAGCAGCCGGCG 

GCGGACGAGCGTCCCACGGAACGCAAACCA 
3T GCTCAGTGCCTGGTTTGCGTTCCGTGGGACGCTCGTCCGCCGCCGGCTGCTCCGTCCTTCCGT 
5 CAGGTTTCCTGCCTGAAAGAACTGGTT 

4B ATGCTTCCGGCGGACCACCACGAGCACCGTCCAGCAGAGCGAAACCGAAAGCCAGAACGTT 

TTTAGCACCACGTTCGCACAGACGCTGCA 
4T GCTCGTGTTCTGCAGCGTCTGTGCGAACGTGGTGCTAAAAACGTTCTGGCTTTCGGTTTCGC 

TCTGCTGGACGGTGCTCGTGGTGGTCCG 
1 0 5B CAACACGACGCAGCAGCAGACCCCAAGCACCGGAACCACGCAGAGCGTCGGTAACGGTGTT 

CGGCAGGTAGGAACGAACGGAGGTGGTGA 
5T CCGGAAGCATTCACCACCTCCGTTCGTTCCTACCTGCCGAACACCGTTACCGACGCTCTGCG 

TGGTTCCGGTGCTTGGGGTCTGCTGCTG 
6B GCGGCGGACCACAAACCTGGTAAGCGCAGGACGGAGCAACCAGAACGAACAGAGCGCAAC 

15 GAGCCAGCAGGTGAACCAGAACGTCGTCAC 

6T CGTCGTGTTGGTGACGACGTTCTGGTTCACCTGCTGGCTCGTTGCGCTCTGTTCGTTCTGGTT 

GCTCCGTCCTGCGCTTACCAGGTTTGT 
7B GGTTCCAAGCACGTTCGCAACCCAGACGACGACGCGGACCGGAAGCGTGCGGCGGCGGAC 

GAGCCTGGGTAGCAGCACCCAGCTGGTACA 
20 7T GGTCCGCCGCTGTACCAGCTGGGTGCTGCTACCCAGGCTCGTCCGCCGCCGCACGCTTCCGG 

TCCGCGTCGTCGTCTGGGTTGCGAACGT 
8B GCAGCGGCAGGGAACGGGAAGCGGAACCACCACGACGACGAGCACCCGGAGCCGGCAGAC 

CCAGCGGAACACCAGCTTCACGAACGGAGT 
8T GCTTGGAACCACTCCGTTCGTGAAGCTGGTGTTCCGCTGGGTCTGCCGGCTCCGGGTGCTCG 

25 TCGTCGTGGTGGTTCCGCTTCCCGTTCC 

9B GACCACGGGTACGACCCGGGTGAGCCCAGGAACCCTGACCAACCGGGGTACGTTCCGGTTC 

CGGAGCAGCACCACGACGCGGACGTTTCG 
9T CTGCCGCTGCCGAAACGTCCGCGTCGTGGTGCTGCTCCGGAACCGGAACGTACCCCGGTTGG 

TCAGGGTTCCTGGGCTCACCCGGGTCGT 
30 10B AGTGACGGGTGCCGGACAGAGCACCTTCCAGGGAGGTAGCTTCTTCAGCCGGACGAGCCGG 

GGAAACAACGCAGAAACCACGGTCGGACG 
1 OT ACCCGTGGTCCGTCCGACCGTGGTTTCTGCGTTGTTTCCCCGGCTCGTCCGGCTGAAGAAGC 

TACCTCCCTGGAAGGTGCTCTGTCCGGC 
11B AAACCGGCGGGCACGGGGTGTCCCACGGACGCGGCGGACGGGAGGTGGACGGCGGACCAG 

35 CGTGGTGCTGACGACCAACGGACGGGTGGG 

11T ACCCGTCACTCCCACCCGTCCGTTGGTCGTCAGCACCACGCTGGTCCGCCGTCCACCTCCCG 

TCCGCCGCGTCCGTGGGACACCCCGTGC 
12B TCAGGGACGGACGCAGGGAGGACAGCAGGAAGGACGGACGCAGCTGTTCTTTGTCACCGG 
AGGAGTACAGGAAGTGTTTGGTTTCAGCGT 
40 12T CCGCCGGTTTACGCTGAAACCAAACACTTCCTGTACTCCTCCGGTGACAAAGAACAGCTGCG 

TCCGTCCTTCCTGCTGTCCTCCCTGCGT 
13B GCTGCGGCAGACGCGGCAGACGACGCGGGGTGCCCGGCATCCACGGACGGGAACCCAGGA 

AGATAGTTTCAACCAGACGACGAGCACCGG 
13T CCGTCCCTGACCGGTGCTCGTCGTCTGGTTGAAACTATCTTCCTGGGTTCCCGTCCGTGGATG 

45 CCGGGCACCCCGCGTCGTCTGCCGCGT 

14B GCGGGCAGTGGGTTTTCAGCAGAACACCATACGGGCACTGAGCGTGGTTGCCCAGCAGTTC 

CAGGAACAGCGGACGCATCTGCCAGTAAC 
14T CTGCCGCAGCGTTACTGGCAGATGCGTCCGCTGTTCCTGGAACTGCTGGGCAACCACGCTCA 

GTGCCCGTATGGTGTTCTGCTGAAAACC 

50 15B GGTCGGTATCTTCTTCTTCCGGAGCAGCAACGGAACCCTGCGGTTTTTCACGAGCGCAAACA 

CCAGCAGCCGGGGTAACAGCAGCACGCA 
15T CACTGCCCGCTGCGTGCTGCTGTTACCCCGGCTGCTGGTGTTTGCGCTCGTGAAAAACCGCA 

GGGTTCCGTTGCTGCTCCGGAAGAAGAA 
16B GCGGAACCAGACGACGCAGGCATGCACGAACGAAACCGTAAACCTGCCACGGGGAGGAGT 

55 GCTGACGCAGCAGCTGAACCAGACGACGCG 
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16T GATACCGACCCGCGTCGTCTGGTTCAGCTGCTGCGTCAGCACTCCTCCCCGTGGCAGGTTTA 

CGGTTTCGTTCGTGCATGCCTGCGTCGT 
1 7B GGG ACAGTTTAGCGTGTTTACCCAGGGAGATG AATTTTTTGGTGTTACGCAGGAAACG ACGT 

TCGTTGTGACGGGAACCCCACAGACCCG 
1 7T CTGGTTCCGCCGGGTCTGTGGGGTTCCCGTCACAACGAACGTCGTTTCCTGCGTAAC ACCAA 

AAAATTCATCTCCCTGGGTAAACACGCT 
1 8B GGTGTTCAGCAGCCGGAACGCAACCAACACCCGGAGAACGACGCAGCCAAGCGCAGTCAC 

GAACGGACATTTTCCAGGTCAGTTCCTGCA 
1 8T AAACTGTCCCTGCAGGAACTGACCTGGAAAATGTCCGTTCGTGACTGCGCTTGGCTGCGTCG 

TTCTCCGGGTGTTGGTTGCGTTCCGGCT 
19B CGGTAACGTAGAAGAAGGAACGCAGCAGTTCAACAACGTATACGGACATCAGCCAGTGCA 

GGAATTTAGCCAGGATTTCTTCACGCAGAC 
19T GCTGAACACCGTCTGCGTGAAGAAATCCTGGCTAAATTCCTGCACTGGCTGATGTCCGTATA 

CGTTGTTGAACTGCTGCGTTCCTTCTTC 
20B GTTTCAGGTGCTGACGGATACCGATGGACTGCAGTTTGGACCAAACGGATTTACGGTAGAA 

GAACAGACGGTTTTTCTGGAAGGTGGTTT 
20T TACGTTACCGAAACCACCTTCCAGAAAAACCGTCTGTTCTTCTACCGTAAATCCGTTTGGTC 

CAAACTGCAGTCCATCGGTATCCGTCAG 
2 IB GATGAAACGCAGACGGGAGGTCAGCAGAGCCGGACGAGCTTCACGGTGCTGACGAACTTCA 

GCTTCGGACAGTTCACGCAGCTGAACAC 
21T CACCTGAAACGTGTTCAGCTGCGTGAACTGTCCGAAGCTGAAGTTCGTCAGCACCGTGAAG 

CTCGTCCGGCTCTGCTGACCTCCCGTCTG 
22B TCAGACGCTCAGCACGTTTTTCACGACGGAAGGTACGAGCACCAACAACGTAGTCCATGTTT 

ACGATCGGACGCAGACCGTCCGGTTTCG 
22T CGTTTCATCCCGAAACCGGACGGTCTGCGTCCGATCGTAAACATGGACTACGTTGTTGGTGC 

TCGTACCTTCCGTCGTGAAAAACGTGCT 
23B CGTCCAGACCCAGAACGGAAGCACCCAGCAGACCCGGACGACGAGCACGTTCGTAGTTCAG 

AACGGAGAACAGAGCTTTAACACGGGAGG 
23T GAGCGTCTGACCTCCCGTGTTAAAGCTCTGTTCTCCGTTCTGAACTACGAACGTGCTCGTCG 

TCCGGGTCTGCTGGGTGCTTCCGTTCTG 
24B CGGTAACGTCAACTTTAACGAAGTACAGTTCCGGCGGCGGGTCCTGAGCACGAACACGCAG 

AACGAAGGTACGCCAAGCACGGTGGATGT 
24T GGTCTGGACGACATCCACCGTGCTTGGCGTACCTTCGTTCTGCGTGTTCGTGCTCAGGACCC 

GCCGCCGGAACTGTACTTCGTTAAAGTT 
25B CGTAACGACGAACGCAGTAGGTGTTCTGCGGTTTGATGATGGAAGCGATAACTTCGGTCAG 

ACGGTCCTGCGGGATGGTGTCGTACGCGC 
25T GACGTTACCGGCGCGTACGACACCATCCCGCAGGACCGTCTGACCGAAGTTATCGCTTCCAT 

CATCAAACCGCAGAACACCTACTGCGTT 
26B GACGCATGTACGGCTGCAGGTCGGTCAGGGTGGAAACGTGGGATTTGAATGCTTTACGAAC 

GTGACCGTGAGCAGCTTTCTGAACAACAG 
26T CGTCGTTACGCTGTTGTTCAGAAAGCTGCTCACGGTCACGTTCGTAAAGCATTCAAATCCCA 

CGTTTCCACCCTGACCGACCTGCAGCCG 
27B GACCGGAGGAAGCTTCGTTCAGGGAGGAGGACTGTTCGATAACAACAGCGTCACGCAGCGG 

GGAGGTTTCCTGCAGGTGAGCAACGAACT 
27T TACATGCGTCAGTTCGTTGCTCACCTGCAGGAAACCTCCCCGCTGCGTGACGCTGTTGTTAT 

CGAACAGTCCTCCTCCCTGAACGAAGCT 
28B AACCCTGCGGGATACCCTGGCACTGAACGTAGGATTTACCACGGATACGAACAGCGTGGTG 

GCACATGAAACGCAGGAAAACGTCGAACA 
28T TCCTCCGGTCTGTTCGACGTTTTCCTGCGTTTCATGTGCCACCACGCTGTTCGTATCCGTGGT 

AAATCCTACGTTCAGTGCCAGGGTATC 
29B GCAGCAGCAGACCGTCACGACGGATACCAGCGAACAGTTTGTTTTCCATGTCACCGTAGCA 

CAGGGAGCACAGCAGGGTGGACAGGATGG 
29T CCGCAGGGTTCCATCCTGTCCACCCTGCTGTGCTCCCTGTGCTACGGTGACATGGAAAACAA 

ACTGTTCGCTGGTATCCGTCGTGACGGT 
30B CGTATTCCGGAACACCACGAACCAGGGTACGCAGGAAGGTTTTAGCGTGGGTCAGGTGCGG 
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AGTAACCAGCAGGAAGTCGTCAACCAGAC 
30T CTGCTGCTGCGTCTGGTTGACGACTTCCTGCTGGTTACTCCGCACCTGACCCACGCTAAAAC 

CTTCCTGCGTACCCTGGTTCGTGGTGTT 
3 IB GAGCCGGCATCTGAACGAAAGCGGTGCCACCCAGAGCTTCGTCTTCAACCGGGAAGTTAAC 

5 AACGGTTTTACGCAGGTTTACAACGCAAC 

3 IT CCGGAATACGGTTGCGTTGTAAACCTGCGTAAAACCGTTGTTAACTTCCCGGTTGAAGACGA 

AGCTCTGGGTGGCACCGCTTTCGTTCAG 
32B GGATGGAGGTACGAGCGTAGGAGGAGTAGTCGGACTGAACTTCCAGGGTACGGGTGTCCAG 

CAGCAGACCGCACCACGGGAACAGACCGT 
1 0 32T ATGCCGGCTCACGGTCTGTTCCCGTGGTGCGGTCTGCTGCTGGACACCCGTACCCTGGAAGT 

TCAGTCCGACTACTCCTCCTACGCTCGT 
33B GGGAGTGGCATTTCAGACGCAGAACACCGAACAGTTTACGACGCATGTTACGACCAGCTTT 

GAAACCACGGTTGAAGGTCAGGGAAGCAC 
33T ACCTCCATCCGTGCTTCCCTGACCTTCAACCGTGGTTTCAAAGCTGGTCGTAACATGCGTCGT 

15 AAACTGTTCGGTGTTCTGCGTCTGAAA 

34B ACGCGTGGAAACGGTAAGCCTGCAGCAGCAGGATTTTGTAGATGTTGGTGCAAACGGTCTG 

CAGGGAGTTTACCTGCAGGTCCAGGAACA 
34T TGCCACTCCCTGTTCCTGGACCTGCAGGTAAACTCCCTGCAGACCGTTTGCACCAACATCTA 

CAAAATCCTGCTGCTGCAGGCTTACCGT 
20 35B AGTAGCACAGGGAAGCGGTGTCGGAGATAACACGCAGGAAGAAGGTCGGGTTTTTCCAAA 

CCTGCTGGTGGAACGGCAGCTGCAGAACGC 
3 5T TTCCACGCGTGCGTTCTGCAGCTGCCGTTCCACCAGCAGGTTTGGAAAAACCCGACCTTCTT 

CCTGCGTGTTATCTCCGACACCGCTTCC 
36B GGCACAGCCACTGAACAGCTTCGGACGGCAGCGGACCAGCAGCACCTTTAGCACCCAGGGA 

25 CATACCAGCGTTTTTAGCTTTCAGGATGG 

36T CTGTGCTACTCCATCCTGAAAGCTAAAAACGCTGGTATGTCCCTGGGTGCTAAAGGTGCTGC 

TGGTCCGCTGCCGTCCGAAGCTGTTCAG 
37B ACAGCTGGGTCTGAGCGGTACGCAGGGAACCCAGCAGCGGAACGTAGGTAACACGGTGAC 

GGGTCAGTTTCAGCAGGAAAGCCTGGT 
30 37T TGGCTGTGCCACCAGGCTTTCCTGCTGAAACTGACCCGTCACCGTGTTACCTACGTTCCGCT 

GCTGGGTTCCCTGCGTACCGCTCAG 
38B ACGGCAGAGCCGGGTTAGCAGCAGCTTCCAGAGCGGTCAGGGTGGTACCCGGCAGTTTACG 

GG 

38T ACCCAGCTGTCCCGTAAACTGCCGGGTACCACCCTGACCGCTCTGGAAGCTGCTGCTAACCC 
35 GG 

39B GCGTGCCTCGAGGAATTCGGATCCATTAGTCCAGGATGGTTTTGAAGTCG 

39T CTCTGCCGTCCGACTTCAAAACCATCCTGGACTAATGGATCCGAATTCCTCGAGGCACGC 



40 

Table 1QB 

GCACGCGGGAGCTCTAGAGTCGACCATATGCCGCGTGCTCCGCGTTGCCGTGCTGTTCGTT 
CCCTGCTGCGTTCCCACTATCGCGAAGTTCTGCCGCTGGCTACCTTCGTTCGTCGTCTGGG 
CCCGCAGGGTTGGCGTCTGGTTCAGCGTGGTGACCCGGCTGCTTTCCGTGCTCTGGTTGCT 

45 CAGTGCCTGGTTTGCGTTCCGTGGGACGCTCGTCCGCCGCCGGCTGCTCCGTCCTTCCGTC 
AGGTTTCCTGCCTGAAAGAACTGGTTGCTCGTGTTCTGCAGCGTCTGTGCGAACGTGGTGC 
TAAAAACGTTCTGGCTTTCGGTTTCGCTCTGCTGGACGGTGCTCGTGGTGGTCCGCCGGAA 
GCATTCACCACCTCCGTTCGTTCCTACCTGCCGAACACCGTTACCGACGCTCTGCGTGGTT 
CCGGTGCTTGGGGTCTGCTGCTGCGTCGTGTTGGTGACGACGTTCTGGTTCACCTGCTGGC 

50 TCGTTGCGCTCTGTTCGTTCTGGTTGCTCCGTCCTGCGCTTACCAGGTTTGTGGTCCGCCG 
CTGTACCAGCTGGGTGCTGCTACCCAGGCTCGTCCGCCGCCGCACGCTTCCGGTCCGCGTC 
GTCGTCTGGGTTGCGAACGTGCTTGGAACCACTCCGTTCGTGAAGCTGGTGTTCCGCTGGG 
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TCTGCCGGCTCCGGGTGCTCGTCGTCGTGGTGGTTCCGCTTCCCGTTCCCTGCCGCTGCCG 
AAACGTCCGCGTCGTGGTGCTGCTCCGGAACCGGAACGTACCCCGGTTGGTCAGGGTTCCT 
GGGCTCACCCGGGTCGTACCCGTGGTCCGTCCGACCGTGGTTTCTGCGTTGTTTCCCCGGC 
TCGTCCGGCTGAAGAAGCTACCTCCCTGGAAGGTGCTCTGTCCGGCACCCGTCACTCCCAC 
5 CCGTCCGTTGGTCGTCAGCACCACGCTGGTCCGCCGTCCACCTCCCGTCCGCCGCGTCCGT 
GGGACACCCCGTGCCCGCCGGTTTACGCTGAAACCAAACACTTCCTGTACTCCTCCGGTGA 
CAAAGAACAGCTGCGTCCGTCCTTCCTGCTGTCCTCCCTGCGTCCGTCCCTGACCGGTGCT 
CGTCGTCTGGTTGAAACTATCTTCCTGGGTTCCCGTCCGTGGATGCCGGGCACCCCGCGTC 
GTCTGCCGCGTCTGCCGCAGCGTTACTGGCAGATGCGTCCGCTGTTCCTGGAACTGCTGGG 

10 CAACCACGCTCAGTGCCCGTATGGTGTTCTGCTGAAAACCCACTGCCCGCTGCGTGCTGCT 
GTTACCCCGGCTGCTGGTGTTTGCGCTCGTGAAAAACCGCAGGGTTCCGTTGCTGCTCCGG 
AAGAAGAAGATACCGACCCGCGTCGTCTGGTTCAGCTGCTGCGTCAGCACTCCTCCCCGTG 
GCAGGTTTACGGTTTCGTTCGTGCATGCCTGCGTCGTCTGGTTCCGCCGGGTCTGTGGGGT 
TCCCGTCACAACGAACGTCGTTTCCTGCGTAACACCAAAAAATTCATCTCCCTGGGTAAAC 

15 ACGCTAAACTGTCCCTGCAGGAACTGACCTGGAAAATGTCCGTTCGTGACTGCGCTTGGCT 
GCGTCGTTCTCCGGGTGTTGGTTGCGTTCCGGCTGCTGAACACCGTCTGCGTGAAGAAATC 
CTGGCTAAATTCCTGCACTGGCTGATGTCCGTATACGTTGTTGAACTGCTGCGTTCCTTCT 
TCTACGTTACCGAAACCACCTTCCAGAAAAACCGTCTGTTCTTCTACCGTAAATCCGTTTG 
GTCCAAACTGCAGTCCATCGGTATCCGTCAGCACCTGAAACGTGTTCAGCTGCGTGAACTG 

20 TCCGAAGCTGAAGTTCGTCAGCACCGTGAAGCTCGTCCGGCTCTGCTGACCTCCCGTCTGC 
GTTTCATCCCGAAACCGGACGGTCTGCGTCCGATCGTAAACATGGACTACGTTGTTGGTGC 
TCGTACCTTCCGTCGTGAAAAACGTGCTGAGCGTCTGACCTCCCGTGTTAAAGCTCTGTTC 
TCCGTTCTGAACTACGAACGTGCTCGTCGTCCGGGTCTGCTGGGTGCTTCCGTTCTGGGTC 
TGGACGACATCCACCGTGCTTGGCGTACCTTCGTTCTGCGTGTTCGTGCTCAGGACCCGCC 

25 GCCGGAACTGTACTTCGTTAAAGTTGACGTTACCGGCGCGTACGACACCATCCCGCAGGAC 
CGTCTGACCGAAGTTATCGCTTCCATCATCAAACCGCAGAACACCTACTGCGTTCGTCGTT 
ACGCTGTTGTTCAGAAAGCTGCTCACGGTCACGTTCGTAAAGCATTCAAATCCCACGTTTC 
CACCCTGACCGACCTGCAGCCGTACATGCGTCAGTTCGTTGCTCACCTGCAGGAAACCTCC 
CCGCTGCGTGACGCTGTTGTTATCGAACAGTCCTCCTCCCTGAACGAAGCTTCCTCCGGTC 

30 TGTTCGACGTTTTCCTGCGTTTCATGTGCCACCACGCTGTTCGTATCCGTGGTAAATCCTA 
CGTTCAGTGCCAGGGTATCCCGCAGGGTTCCATCCTGTCCACCCTGCTGTGCTCCCTGTGC 
TACGGTGACATGGAAAACAAACTGTTCGCTGGTATCCGTCGTGACGGTCTGCTGCTGCGTC 
TGGTTGACGACTTCCTGCTGGTTACTCCGCACCTGACCCACGCTAAAACCTTCCTGCGTAC 
CCTGGTTCGTGGTGTTCCGGAATACGGTTGCGTTGTAAACCTGCGTAAAACCGTTGTTAAC 

35 TTCCCGGTTGAAGACGAAGCTCTGGGTGGCACCGCTTTCGTTCAGATGCCGGCTCACGGTC 
TGTTCCCGTGGTGCGGTCTGCTGCTGGACACCCGTACCCTGGAAGTTCAGTCCGACTACTC 
CTCCTACGCTCGTACCTCCATCCGTGCTTCCCTGACCTTCAACCGTGGTTTCAAAGCTGGT 
CGTAACATGCGTCGTAAACTGTTCGGTGTTCTGCGTCTGAAATGCCACTCCCTGTTCCTGG 
ACCTGCAGGTAAACTCCCTGCAGACCGTTTGCACCAACATCTACAAAATCCTGCTGCTGCA 

40 GGCTTACCGTTTCCACGCGTGCGTTCTGCAGCTGCCGTTCCACCAGCAGGTTTGGAAAAAC 
CCGACCTTCTTCCTGCGTGTTATCTCCGACACCGCTTCCCTGTGCTACTCCATCCTGAAAG 
CTAAAAACGCTGGTATGTCCCTGGGTGCTAAAGGTGCTGCTGGTCCGCTGCCGTCCGAAGC 
TGTTCAGTGGCTGTGCCACCAGGCTTTCCTGCTGAAACTGACCCGTCACCGTGTTACCTAC 
GTTCCGCTGCTGGGTTCCCTGCGTACCGCTCAGACCCAGCTGTCCCGTAAACTGCCGGGTA 

45 CCACCCTGACCGCTCTGGAAGCTGCTGCTAACCCGGCTCTGCCGTCCGACTTCAAAACCAT 
CCTGGACTAATGGATCCGAATTCCTCGAGGCACGC 
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The present invention also provides transgenic animals (i.e., mammals 
transgenic for a human or other TRT gene sequence) expressing an hTRT or other TRT 
polynucleotide or polypeptide. In one embodiment, hTRT is secreted into the milk of a 
transgenic mammal such as a transgenic bovine, goat, or rabbit. Methods for production of 
such animals are found, e.g., in Heyneker et al., PCT WO 91/08216. 

The hTRT proteins and complexes of the invention, including those made 
using the expression systems disclosed herein supra, may be purified using a variety of 
general methods known in the art in accordance with the specific methods provided by the 
present invention (e.g., infra). One of skill in the art will recognize that after chemical 
synthesis, biological expression, or purification, the hTRT protein may possess a 
conformation different than a native conformation of naturally occurring telomerase. In some 
instances, it may be helpful or even necessary to denature (e.g., including reduction of 
disulfide or other linkages) the polypeptide and then to cause the polypeptide to re-fold into 
the preferred conformation. Productive refolding may also require the presence of hTR (or 
hTR fragments). Methods of reducing and denaturing proteins and inducing re-folding are 
well known to those of skill in the art (see, e.g., Debinski et al., 1993, J. Biol. Chem., 
268:14065; Kreitman and Pastan, 1993, Bioconjug. Chem., 4:581; and Buchner et al., 1992, 
Anal. Biochem., 205:263; and McCaman et al., 1985, J. Biotech. 2:177). See also PCT 
Publication WO 96/40868, supra. 

D) COMPLEXES OF HUMAN TRT AND HUMAN TELOMERASE RNA, 
TELOMERASE- ASSOCIATED PROTEINS, AND OTHER BIOMOLECULES 
PRODUCED BY COEXPRESSION AND OTHER MEANS 

hTRT polypeptides of the invention can associate in vivo and in vitro with 
other biomolecules, including RNAs (e.g., hTR), proteins (e.g., telomerase-associated 
proteins), DNA (e.g., telomeric DNA, [T 2 AG 3 ] N ), and nucleotides, such as 
(deoxy)ribonucleotide triphosphates. These associations can be exploited to assay hTRT 
presence or function, to identify or purify hTRT or telomerase-associated molecules, and to 
analyze hTRT or telomerase structure or function in accordance with the methods of the 
present invention. 
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In one embodiment, the present invention provides hTRT complexed with 
(e.g., associated with or bound to) a nucleic acid, usually an RNA, for example to produce a 
telomerase holoenzyme. In one embodiment, the bound RNA is capable of acting as a 
template for telomerase-mediated DNA synthesis. Examples of RNAs that may be 
5 complexed with the hTRT polypeptide include a naturally occurring host cell telomerase 

RNA, a human telomerase RNA (e.g., hTR; U.S. Patent No. 5,583,016), an hTR subsequence 
or domain, a synthetic RNA, or other RNAs. The RNA-hTRT protein complex (an RNP) 
typically exhibits one or more telomerase activities, such as telomerase catalytic activities. 
These hTRT-hTR RNPs (or other hTRT-RNA complexes) can be produced by a variety of 

10 methods, as described infra for illustrative purposes, including in vitro reconstitution, by co- 
expression of hTRT and hTR (or other RNA) in vitro (i.e., in a cell free system), in vivo 
reconstitution, or ex vivo reconstitution. 

Thus, the present invention provides, in one embodiment, an hTRT-hTR 
complex (or other hTRT-RNA complex) formed in vitro by mixing separately purified 

15 components ("/« vitro reconstitution;" see, e.g., U.S. Patent No. 5,583,016 for a description of 
reconstitution; also see Autexier et al., EMBO J. 15:5928). In one embodiment the hTRT 
protein is produced by recombinant expression in human or non-human cells, e.g., as 
described supra, and subsequently purified using protein purification methods (e.g., 
chromatography, affinity purification). In a particular embodiment, the recombinant hTRT 

20 protein is purified to homogeneity. The purified hTRT protein is combined with separately 
purified hTR, which may be produced using an in vitro transcription system, by chemical 
synthesis, or by other methods and purified using standard RNA purification techniques (see 
Melton etal., 1984, Nucl Acids Res. 12:7035; StudieretaL, 1986, J. Mol Biol 189:113). 

In an alternative embodiment, the invention provides telomerase RNPs 

25 produced by coexpression of the hTRT polypeptide and an RNA (e.g., hTR) in vitro in a cell- 
free transcription-translation system (e.g. wheat germ or rabbit reticulocyte lysate). As 
shown in Example 7, in vitro co-expression of a recombinant hTRT polypeptide and hTR 
results in production of telomerase catalytic activity (as measured by a TRAP assay). 

Further provided by the present invention are telomerase RNPs produced by 

30 expression of the hTRT polypeptide in a cell, e.g., a mammalian cell, in which hTR is 

naturally expressed or in which hTR (or another RNA capable of forming a complex with the 
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hTRT protein) is introduced or expressed by recombinant means. Thus, in one embodiment, 
hTRT is expressed in a telomerase negative human cell in which hTR is present (e.g., BJ or 
IMP90 cells), allowing the two molecules to assemble into an RNP. In another embodiment, 
hTRT is expressed in a human or non-human cell in which hTR is recombinantly expressed. 

5 Methods for expression of hTR in a cell are found in U.S. Patent 5,583,016. Further, a clone 
containing a cDNA encoding the RNA component of telomerase has been placed on deposit 
as pGRN33 (ATCC 75926). Genomic sequences encoding the RNA component of human 
telomerase are also on deposit in the -15 kb SauIIIAl to Hindlll insert of lambda clone 28-1 
(ATCC 75925). For expression in eukaryotic cells the hTRT sequence will typically be 

1 0 operably linked to a transcription initiation sequence (RNA polymerase binding site) and 
transcription terminator sequences (see, e.g., PCT Publication WO 96/01835; Feng et al., 

1995, Science 269:1236). 

The present invention further provides recombinantly produced or 
substantially purified hTRT polypeptides coexpressed and/or associated with so-called 

1 5 "telomerase-associated proteins." Thus, the present invention provides hTRT coexpressed 
with, or complexed with, other proteins (e.g., telomerase-associated proteins). Telomerase- 
associated proteins are those proteins that copurify with human telomerase and/or that may 
play a role in modulating telomerase function or activity, for example by participating in the 
association of telomerase with telomeric DNA. Examples of telomerase-associated proteins 

20 include (but are not limited to) the following proteins and/or their human homologs: 

nucleolin (see, Srivastava et al., 1989, FEBS Letts. 250:99); EF2H (elongation factor 2 
homolog; see Nomura et al. 1994, DNA Res. (Japan) 1:27, GENBANK accession #D21 163); 
TP1/TLP1 (Harrington et al., 1997, Science 275:973; Nakayama, 1997, Cell 88:875); the 
human homologue of the Tetrahymena p95 or p95 itself (Collins et al., 1995, Cell 81:677); 

25 TPC2 (a telomere length regulatory protein; ATCC accession number 97708; TPC3 (also a 
telomere length regulatory protein; ATCC accession number 97707; DNA-binding protein B 
(dbpB; Horwitz et al., 1994, J. Biol. Chem. 269:14130; and Telomere Repeat Binding Factors 
(TRF 1 & 2; Chang et al., 1995, Science 270:1663; Chong et al., 1997, Hum Mol Genet 6:69); 
EST1, 3 and 4 (Lendvay et al., 1996, Genetics 144:1399, Nugent et al., 1996, Science 

30 274:249, Lundblad et al., 1989, Cell 57:633); and End-capping factor (Cardenas et al., 1993, 
Genes Dev. 7:883). 
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Telomerase associated proteins can be identified on the basis of co-purification 
with, or binding to, hTRT protein or the hTRT-hTR RNP. Alternatively, they can be 
identified on the basis of binding to an hTRT fusion protein, e.g., a GST-hTRT fusion protein 
or the like, as determined by affinity purification (see, Ausubel et al. Ch 20). A particularly 

5 useful technique for assessing protein-protein interactions, which is applicable to identifying 
hTRT-associated proteins, is the two hybrid screen method of Chien et al. {Proc. Natl. Acad. 
Sci. USA 88:9578 [1991]; see also Ausubel et al., supra, at Ch. 20). This screen identifies 
protein-protein interactions in vivo through reconstitution of a transcriptional activator, the 
yeast Gal4 transcription protein (see, Fields and Song, 1989, Nature 340:245. The method is 

1 0 based on the properties of the yeast Gal4 protein, which consists of separable domains 
responsible for DNA-binding and transcriptional activation. Polynucleotides, usually 
expression vectors, encoding two hybrid proteins are constructed. One polynucleotide 
comprises the yeast Gal4 DNA-binding domain fused to a polypeptide sequence of a protein 
to be tested for an hTRT interaction (e.g., nucleolin or EF2H). Alternatively the yeast Gal4 

1 5 DNA-binding domain is fused to cDNAs from a human cell, thus creating a library of human 
proteins fused to the Gal4 DNA binding domain for screening for telomerase associated 
proteins. The other polynucleotide comprises the Gal4 activation domain fused to an hTRT 
polypeptide sequence. The constructs are introduced into a yeast host cell. Upon expression, 
intermolecular binding between hTRT and the test protein can reconstitute the Gal4 DNA- 

20 binding domain with the Gal4 activation domain. This leads to the transcriptional activation 
of a reporter gene (e.g., lacZ, HIS3) operably linked to a Gal4 binding site. By selecting for, 
or by assaying the reporter, gene colonies of cells that contain an hTRT interacting protein or 
telomerase associated protein can be identified. Those of skill will appreciate that there are 
numerous variations of the 2-hybrid screen, e.g., the LexA system (Bartel et al, 1993, in 

25 Cellular Interactions in Development: A Practical Approach Ed. Hartley, D.A. (Oxford Univ. 

Press) pp. 153-79). 

Another useful method for identifying telomerase-associated proteins is a 
three-hybrid system (see, e.g., Zhang et al., 1996, Anal. Biochem. 242:68; Licitra et al., 1996, 
Proc. Natl. Acad. Sci. USA 93:12817). The telomerase RNA component can be utilized in 
30 this system with the TRT or hTRT protein and a test protein. Another useful method for 

identifying interacting proteins, particularly (i.e., proteins that heterodimerize or form higher 
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order heteromultimers), is the E. coWBCCP interactive screening system (see, Germino et al. 
(1993) Proc. Natl Acad Set U.S.A. 90:933; Guarente (1993) Proc. Natl. Acad. Sci. (U.S.A.) 
90:1639). 

The present invention also provides complexes of telomere binding proteins 
5 (which may or may not be telomerase associated proteins) and hTRT (which may or may not 
be complexed with hTR, other RNAs, or one or more telomerase associated proteins). 
Examples of telomere binding proteins include TRF1 and TRF2 (supra); rnpAl, rnpA2, 
RAP1 (Buchman et al., 1988, Mol Cell. Biol 8:210, Buchman et al., 1988, Mol Cell. Biol 
8:5086), SIR3 and SIR4 (Aparicio et al, 1991, Cell 66:1279), TEL1 (Greenwell et al., 1995, 
10 Cell 82:823; Morrow et al., 1995, Cell 82:831); ATM (Savitsky et al., 1995, Science 

268:1749), end-capping factor (Cardenas et al., 1993, Genes Dev. 7:883), and corresponding 
human homologs. The aforementioned complexes may be produced generally as described 
supra for complexes of hTRT and hTR or telomerase associated proteins, e.g., by mixing or 
co-expression in vitro or in vivo. 

15 

V. ANTIBODIES AND OTHER BINDING AGENTS 

In a related aspect, the present invention provides antibodies that are 
specifically immunoreactive with hTRT, including polyclonal and monoclonal antibodies, 
antibody fragments, single chain antibodies, human and chimeric antibodies, including 

20 antibodies or antibody fragments fused to phage coat or cell surface proteins, and others 
known in the art and described herein. The antibodies of the invention can specifically 
recognize and bind polypeptides that have an amino acid sequence that is substantially 
identical to the amino acid sequence set forth in Figure 17 (SEQUENCE ID NO: 2), or an 
immunogenic fragment thereof or epitope on the protein defined thereby. The antibodies of 

25 the invention can exhibit a specific binding affinity for hTRT of at least about 10 7 , 10 8 , 10 9 , or 
10 10 M" 1 , and may be polyclonal, monoclonal, recombinant or otherwise produced. The 
invention also provides anti-hTRT antibodies that recognize an hTRT conformational epitope 
(e.g., an epitope on the surface of the hTRT protein or a telomerase RNP). Likely 
conformational epitopes can be identified, if desired, by computer-assisted analysis of the 

30 hTRT protein sequence, comparison to the conformation of related reverse transcriptases, 

such as the p66 subunit of HIV- 1 (see, e.g., Figure 3), or empirically. Anti-hTRT antibodies 
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that recognize conformational epitopes have utility, inter alia, in detection and purification of 
human telomerase and in the diagnosis and treatment of human disease. 

For the production of anti-hTRT antibodies, hosts such as goats, sheep, cows, 
guinea pigs, rabbits, rats, or mice, may be immunized by injection with hTRT protein or any 
5 portion, fragment or oligopeptide thereof which retains immunogenic properties. In selecting 
hTRT polypeptides for antibody induction, one need not retain biological activity; however, 
the protein fragment, or oligopeptide must be immunogenic, and preferably antigenic. 
Immunogenicity can be determined by injecting a polypeptide and adjuvant into an animal 
(e.g., a rabbit) and assaying for the appearance of antibodies directed against the injected 

10 polypeptide (see, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold 

SPRING HARBOR Laboratory, New York (1988), which is incorporated in its entirety and 
for all purposes, e.g., at Chapter 5). Peptides used to induce specific antibodies typically have 
an amino acid sequence consisting of at least five amino acids, preferably at least 8 amino 
acids, more preferably at least 10 amino acids. Usually they will mimic or have substantial 

1 5 sequence identity to all or a contiguous portion of the amino acid sequence of the protein of 
SEQUENCE ID NO: 2. Short stretches of hTRT protein amino acids may be fused with 
those of another protein, such as keyhole limpet hemocyanin, and an anti-hTRT antibody 
produced against the chimeric molecule. Depending on the host species, various adjuvants 
may be used to increase immunological response. 

20 The antigen is presented to the immune system in a fashion determined by 

methods appropriate for the animal. These and other parameters are generally well known to 
immunologists. Typically, injections are given in the footpads, intramuscularly, 
intradermally, perilymph nodally or intraperitoneally. The immunoglobulins produced by the 
host can be precipitated, isolated and purified by routine methods, including affinity 

25 purification. 

Illustrative examples of immunogenic hTRT peptides include are provided in 
Example 8. In addition, Example 8 describes the production and use of anti-hTRT polyclonal 
antibodies* 

30 A) MONOCLONAL ANTIBODIES 

Monoclonal antibodies to hTRT proteins and peptides may be prepared in 
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accordance with the methods of the invention using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are 
not limited to, the hybridoma technique originally described by Koehler and Milstein {Nature 
256:495 [1975]), the human B-cell hybridoma technique (Kosbor et al., 1983, Immunol 
Today 4:72; Cote et al., 1983, Proc. Natl Acad. Set USA, 80:2026), and the EBV-hybridoma 
technique (Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R Liss Inc, 
New York NY, pp 77-96 [1985]). 

In one embodiment, appropriate animals are selected and the appropriate 
immunization protocol followed. The production of non-human monoclonal antibodies, e.g., 
murine, lagomorpha, equine, is well known and can be accomplished by, for example, 
immunizing an animal with a preparation containing hTRT or fragments thereof. In one 
method, after the appropriate period of time, the spleens of the animals are excised and 
individual spleen cells are fused, typically, to immortalized myeloma cells under appropriate 
selection conditions. Thereafter, the cells are clonally separated and the supernatants of each 
clone (e.g., hybridoma) are tested for the production of an appropriate antibody specific for 
the desired region of the antigen. Techniques for producing antibodies are well known in the 
art. See, e.g., Goding et al, Monoclonal Antibodies: Principles and Practice (2d ed.) 
Acad. Press, N.Y., and Harlow and Lane, supra, each of which is incorporated in its entirety 
and for all purposes. Other suitable techniques involve the in vitro exposure of lymphocytes 
to the antigenic polypeptides or alternatively, to selection of libraries of antibodies in phage 
or similar vectors (see, infra). 

B) HUMAN ANTIBODIES 

In another aspect of the invention, human antibodies against an hTRT 
polypeptide are provided. Human monoclonal antibodies against a known antigen can also be 
made using transgenic animals having elements of a human immune system {see, e.g., U.S. 
Patent Nos. 5,569,825 and 5,545,806, both of which are incorporated by reference in their 
entirety for all purposes) or using human peripheral blood cells (Casali et al., 1986, Science 
234:476). Some human antibodies are selected by competitive binding experiments, or 
otherwise, to have the same epitope specificity as a particular mouse antibody. 

In an alternative embodiment, human antibodies to an hTRT polypeptide can 
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be produced by screening a DNA library from human B cells according to the general 
protocol outlined by Huse et al., 1989, Science 246:1275, which is incorporated by reference. 
Antibodies binding to the hTRT polypeptide are selected. Sequences encoding such 
antibodies (or binding fragments) are then cloned and amplified. The protocol described by 
5 Huse is often used with phage-display technology. 

C) HUMANIZED OR CHIMERIC ANTIBODIES 

The invention also provides anti-hTRT antibodies that are made chimeric, 
human-like or humanized, to reduce their potential antigenicity, without reducing their 

1 0 affinity for their target. Preparation of chimeric, human-like and humanized antibodies have 
been described in the art (see, e.g., U.S. Patent Nos. 5,585,089 and 5,530,101; Queen, et al, 
1989, Proc. Nat'lAcad. Set USA 86:10029; and Verhoeyan et al., 1988, Science 239:1534; 
each of which is incorporated by reference in their entirety and for all purposes). Humanized 
immunoglobulins have variable framework regions substantially from a human 

1 5 immunoglobulin (termed an acceptor immunoglobulin) and complementarity determining 
regions substantially from a non-human (e.g., mouse) immunoglobulin (referred to as the 
donor immunoglobulin). The constant region(s), if present, are also substantially from a 
human immunoglobulin. 

In some applications, such as administration to human patients, the humanized 

20 (as well as human) anti-hTRT antibodies of the present invention offer several advantages 
over antibodies from murine or other species: (1) the human immune system should not 
recognize the framework or constant region of the humanized antibody as foreign, and 
therefore the antibody response against such an injected antibody should be less than against 
a totally foreign mouse antibody or a partially foreign chimeric antibody; (2) because the 

25 effector portion of the humanized antibody is human, it may interact better with other parts of 
the human immune system; and (3) injected humanized antibodies have a half-life essentially 
equivalent to naturally occurring human antibodies, allowing smaller and less frequent doses 
than antibodies of other species. As implicit from the foregoing, anti hTRT antibodies have 
application in the treatment of disease, i.e., to target telomerase-positive cells. 
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D) PHAGE DISPLAY 

The present invention also provides anti-hTRT antibodies (or binding 
compositions) produced by phage display methods (see, e.g., Dower et al., WO 91/17271 and 
McCafferty et al., WO 92/01047; and Vaughan et al, 1996, Nature Biotechnology, 14: 309; 
each of which is incorporated by reference in its entirety for all purposes). In these methods, 
libraries of phage are produced in which members display different antibodies on their outer 
surfaces. Antibodies are usually displayed as Fv or Fab fragments. Phage displaying 
antibodies with a desired specificity are selected by affinity enrichment to an hTRT 
polypeptide. 

In a variation of the phage-display method, humanized antibodies having the 
binding specificity of a selected murine antibody can be produced. In this method, either the 
heavy or light chain variable region of the selected murine antibody is used as a starting 
material. If, for example, a light chain variable region is selected as the starting material, a 
phage library is constructed in which members display the same light chain variable region 
(i.e., the murine starting material) and a different heavy chain variable region. The heavy 
chain variable regions are obtained from a library of rearranged human heavy chain variable 
regions. A phage showing strong specific binding for the hTRT polypeptide (e.g., at least 10 s 
and preferably at least 10 9 M" 1 ) is selected. The human heavy chain variable region from this 
phage then serves as a starting material for constructing a further phage library. In this 
library, each phage displays the same heavy chain variable region (i.e., the region identified 
from the first display library) and a different light chain variable region. The light chain 
variable regions are obtained from a library of rearranged human variable light chain regions. 
Again, phage showing strong specific binding are selected. These phage display the variable 
regions of completely human anti-hTRT antibodies. These antibodies usually have the same 
or similar epitope specificity as the murine starting material. 

E) HYBRID ANTIBODIES 

The invention also provides hybrid antibodies that share the specificity of 
antibodies against an hTRT polypeptide but are also capable of specific binding to a second 
moiety. In such hybrid antibodies, one heavy and light chain pair is usually from an anti- 
hTRT antibody and the other pair from an antibody raised against another epitope or protein. 



91 



• 



This results in the property of multi-functional valency, i.e., ability to bind at least two 
different epitopes simultaneously, where at least one epitope is the epitope to which the anti- 
complex antibody binds. Such hybrids can be formed by fusion of hybridomas producing the 
respective component antibodies, or by recombinant techniques. Such hybrids can be used to 
5 carry a compound (i.e., drug) to a telomerase-positive cell (i.e., a cytotoxic agent is delivered 
to a cancer cell). 

Immunoglobulins of the present invention can also be fused to functional 
regions from other genes (e.g., enzymes) to produce fusion proteins (e.g., immunotoxins) 
having useful properties. 

10 

F) ANTI-IDIOTYPIC ANTIBODIES 

Also useful are anti-idiotype antibodies which can be isolated by the above 
procedures. Anti-idiotypic antibodies may be prepared by, for example, immunization of an 
animal with the primary antibody (i.e., anti-hTRT antibodies or hTRT-binding fragments 

1 5 thereof). For anti-hTRT antibodies, anti-idiotype antibodies whose binding to the primary 
antibody is inhibited by an hTRT polypeptide or fragments thereof are selected. Because 
both the anti-idiotypic antibody and the hTRT polypeptide or fragments thereof bind the 
primary immunoglobulin, the anti-idiotypic immunoglobulin can represent the "internal 
image" of an epitope and thus can substitute for the hTRT polypeptide in assays or can be 

20 used to bind (i.e., inactivate) anti-hTRT antibodies, e.g., in a patient. Anti-idiotype antibodies 
can also interact with telomerase associated proteins. Administration of such antibodies can 
affect telomerase function by titrating out or competing with hTRT in binding to hTRT- 
associated proteins. 



25 G) GENERAL 

The antibodies of the invention may be of any isotype, e.g., IgM, IgD, IgG, 
IgA, and IgE, with IgG, IgA and IgM often preferred. Humanized antibodies may comprise 
sequences from more than one class or isotype. 

In another embodiment of the invention, fragments of the intact antibodies 
30 described above are provided. Typically, these fragments can compete with the intact 

antibody from which they were derived for specific binding to the hTRT polypeptide, and 
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bind with an affinity of at least 10 7 , 10 8 - 10 9 M" 1 , or 10 10 M" 1 . Antibody fragments include 
separate heavy chains, light chains, Fab, Fab' F(ab% Fabc, and Fv. Fragments can be 
produced by eirzymatic or chemical separation of intact immunoglobulins. For example, a 
F(ab') 2 fragment can be obtained from an IgG molecule by proteolytic digestion with pepsin 
at pH 3.0-3.5 using standard methods such as those described in Harlow and Lane, supra. 
Fab fragments may be obtained from F(ab') 2 fragments by limited reduction, or from whole 
antibody by digestion with papain in the presence of reducing agents {see generally, Paul, W., 
ed Fundamental Immunology 2nd Raven Press, N.Y., 1989, Ch. 7, incorporated by 
reference in its entirety for all purposes). Fragments can also be produced by recombinant 
DNA techniques. Segments of nucleic acids encoding selected fragments are produced by 
digestion of full-length coding sequences with restriction enzymes, or by de novo synthesis. 
Often fragments are expressed in the form of phage-coat fusion proteins. 

Many of the immunoglobulins described above can undergo non-critical 
amino-acid substitutions, additions or deletions in both the variable and constant regions 
without loss of binding specificity or effector functions, or intolerable reduction of binding 
affinity (i.e., below about 10 7 M" 1 ). Usually, immunoglobulins incorporating such alterations 
exhibit substantial sequence identity to a reference immunoglobulin from which they were 
derived. A mutated immunoglobulin can be selected having the same specificity and 
increased affinity compared with a reference immunoglobulin from which it was derived. 
Phage-display technology offers useful techniques for selecting such immunoglobulins. See, 
e.g., Dower et al., WO 91/17271 McCafferty et al., WO 92/01047; and Huse, WO 92/06204. 

The antibodies of the present invention can be used with or without 
modification. Frequently, the antibodies will be labeled by joining, either covalently or non- 
covalently, a detectable label. As labeled binding entities, the antibodies of the invention are 
particularly useful in diagnostic applications. 

The anti-hTRT antibodies of the invention can be purified using well known 
methods. The whole antibodies, their dimers, individual light and heavy chains, or other 
immunoglobulin forms of the present invention can be purified using the methods and 
reagents of the present invention in accordance with standard procedures of the art, including 
ammonium sulfate precipitation, affinity columns, column chromatography, gel 
electrophoresis and the like {see generally Scopes, Protein Purification: Principles and 
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Practice 3rd Edition (Springer-Verlag, N.Y., 1994)). Substantially pure immunoglobulins 
of at least about 90 to 95%, or even 98 to 99% or more homogeneity are preferred. 

VI. PURIFICATION OF HUMAN TELOMERASE 

5 The present invention provides isolated human telomerase of unprecedented 

purity. In particular, the present invention provides: purified hTRT of recombinant or 
nonrecombinant origin; purified hTRT-hTR complexes (i.e., RNPs) of recombinant, 
nonrecombinant, or mixed origin, optionally comprising one or more telomerase-associated 
proteins; purified naturally occurring human telomerase; and the like. Moreover, the 

10 invention provides methods and reagents for partially, substantially or highly purifying the 
above-molecules and complexes, including variants, fusion proteins, naturally occurring 
proteins, and the like (collectively referred to as "hTRT and/or hTRT complexes"). 

Prior to the present disclosure, attempts had been made to purify the 
telomerase enzyme complex to homogeneity had met with limited success. The methods 

1 5 provided in the aforelisted applications provide purification of telomerase by approximately 
up to 60,000-fold or more compared to crude cell extracts. The present invention provides 
hTRT and hTRT complexes of even greater purity, in part by virtue of the novel 
immunoaffinity reagents (e.g., anti-hTRT antibodies) of the present invention, and/or the 
reagents, cells, and methods provided herein for recombinant expression of hTRT. 

20 Recombinant expression of hTRT and hTRT complexes facilitates purification because the 
desired molecules can be produced at much higher levels than found in most expressing cells 
occurring in nature, and/or because the recombinant hTRT molecule can be modified (e.g., by 
fusion with an epitope tag) such that it may be easily purified. 

It will be recognized that naturally occurring telomerase can be purified from 

25 any telomerase-positive cell, and recombinant hTRT and hTRT complexes can be expressed 
and purified, inter alia, using any of the in vitro, in vivo, ex vivo, or plant or animal 
expression systems disclosed supra, or others/sy stems known in the art. 

In one embodiment, the hTRT, telomerase and other compositions of the 
invention are purified using an immunoaffinity step, alone or in combination with other 

30 purification steps. Typically, an immobilized or immobilizable anti-hTRT antibody, as 
provided by the present invention, is contacted with a sample, such as a cell lysate, that 
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contains the desired hTRT or hTRT-containing complex under conditions in which anti- 
hTRT antibody binds the hTRT antigen. After removal of the unbound components of the 
sample by methods well known in the art, the hTRT composition may be eluted, if desired, 
from the antibody, in substantially pure form. In one embodiment, immunoaffinity 
5 chromatography methods well known in the art are used (see, e.g., Harlow and Lane, supra; 
and Ausubel, supra; Hermansan et al., 1992, Immobilized Affinity Ligand Techniques 
(Academic Press, San Diego)) in accordance with the methods of the invention. In another 
illustrative embodiment, immunoprecipitation of anti-hTRT-immunoglobulin-hTRT 
complexes is carried out using immobilized Protein A. Numerous variations and alternative 
10 immunoaffinity purification protocols suitable for use in accordance with the methods and 
reagents of the invention are well-known to those of skill. 

In another embodiment, recombinant hTRT proteins can, as a consequence of 
their high level of expression, be purified using routine protein purification methods, such as 
ammonium sulfate precipitation, affinity columns (e.g., immunoaffinity), size-exclusion, 
15 anion and cation exchange chromatography, gel electrophoresis and the like (see, generally, 
R. Scopes, Protein Purification, Springer-Verlag, N.Y. (1982) and Deutscher, Methods 
in Enzymology Vol. 182: Guide to Protein Purification, Academic Press, Inc. N.Y. 
(1990)) instead of, or in addition to, immunoaffinity methods. Cation exchange methods can 
be particularly useful due to the basic pi of the hTRT protein. For example, immobilized 
20 phosphate may be used as a cation exchange functional group (e.g., P-l 1 Phosphocellulose, 
Whatman catalog #4071 or Cellulose Phosphate, Sigma catalog #C 3145). Immobilized 
phosphate has two advantageous features for hTRT purification - it is a cation exchange resin, 
and it shows physical resemblance to the phosphate backbone of nucleic acid. This can allow 
for affinity chromatography because hTRT binds hTR and telomeric DNA. Other non- 
25 specific and specific nucleic acid affinity chromatography methods are also useful for 

purification (e.g., Alberts etaL, 1971, Methods Enzymol 21:198; Amt-Jovin et al., 1975, 
Eur. J. Biochem. 54:411; Pharmacia catalog #27-5575-02). Further exploitation of this 
binding function of hTRT could include the use of specific nucleic acid (e.g., telomerase 
primer or hTR) affinity chromatography for purification (Chodosh et al., 1986, Mol Cell 
30 Biol 6:4723; Wu et al., 1987, Science 238:1247; Kadonaga, 1991, Methods Enzymol 

208:10); immobilized Cibricon Blue Dye, which shows physical resemblance to nucleotides, 
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is another useful resin for hTRT purification (Pharmacia catalog #17-0948-01 or Sigma 
catalog #C 1285), due to hTRT binding of nucleotides (e.g., as substrates for DNA synthesis). 

In one embodiment, hTRT proteins are isolated directly from an in vitro or in 
vivo expression system in which other telomerase components are not coexpressed. It will be 
5 recognized that isolated hTRT protein may also be readily obtained from purified human 
telomerase or hTRT complexes, for example, by disrupting the telomerase RNP (e.g., by 
exposure to a mild or other denaturant) and separating the RNP components (e.g., by routine 
means such as chromatography or immunoaffinity chromatography). 

Telomerase purification may be monitored using a telomerase activity assay 

10 (e.g., the TRAP assay, conventional assay, or primer-binding assay), by measuring the 
enrichment of hTRT (e.g., by ELISA), by measuring the enrichment of hTR, or other 
methods known in the art. 

The purified human telomerase, hTRT proteins, and hTRT complexes 
provided by the present invention are, in one embodiment, highly purified (i.e., at least about 

1 5 90% homogeneous, more often at least about 95% homogeneous). Homogeneity can be 
determined by standard means such as SDS-polyacrylamide gel electrophoresis and other 
means known in the art (see, e.g., Ausubel et al, supra). It will be understood that, although 
highly purified human telomerase, hTRT protein, or hTRT complexes are sometimes desired, 
substantially purified (e.g., at least about 75% homogeneous) or partially purified (e.g., at 

20 least about 20% homogeneous) human telomerase, hTRT protein, or hTRT complexes are 
useful in many applications, and are also provided by the present invention. For example, 
partially purified telomerase is useful for screening test compounds for telomerase 
modulatory activity, and other uses (see, infra and supra; see U.S. Patent No. 5,645,986). 

25 VII. TREATMENT OF TELOMERASE-RELATED DISEASE 
A) INTRODUCTION 

The present invention provides hTRT polynucleotides, polypeptides, and 
antibodies useful for the treatment of human diseases and disease conditions. The 
recombinant and synthetic hTRT gene products (protein and mRNA) of the invention can be 
30 used to create or elevate telomerase activity in a cell, as well as to inhibit telomerase activity 
in cells in which it is not desired. Thus, inhibiting, activating or otherwise altering a 
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telomerase activity (e.g., telomerase catalytic activity, fidelity, processivity, telomere binding, 
etc.) in a cell can be used to change the proliferative capacity of the cell. For example, 
reduction of telomerase activity in an immortal cell, such as a malignant tumor cell, can 
render the cell mortal. Conversely, increasing the telomerase activity in a mortal cell (e.g., 
5 most human somatic cells) can increase the proliferative capacity of the cell. For example, 
expression of hTRT protein in dermal fibroblasts, thereby increasing telomere length, will 
result in increased fibroblast proliferative capacity; such expression can slow or reverse the 
age-dependent slowing of wound closure (see, e.g., West, 1994, Arch Derm, 130:87). 

Thus, in one aspect, the present invention provides reagents and methods 
10 useful for treating diseases and conditions characterized by the presence, absence, or amount 
of human telomerase activity in a cell and that are susceptible to treatment using the 
compositions and methods disclosed herein. These diseases include, as described more folly 
below, cancers, other diseases of cell proliferation (particularly diseases of aging), 
immunological disorders, infertility (or fertility), and others. 

15 

B) TREATMENT OF CANCER 

The present invention provides methods and compositions for reducing 
telomerase activity in tumor cells and for treating cancer. Compositions include antisense 
oligonucleotides, peptides, gene therapy vectors encoding antisense oligonucleotides or 

20 activity altering proteins, and anti-hTRT antibodies. Cancer cells (e.g., malignant tumor 
cells) that express telomerase activity (telomerase-positive cells) can be mortalized by 
decreasing or inhibiting the endogenous telomerase activity. Moreover, because telomerase 
levels correlate with disease characteristics such as metastatic potential (e.g., U.S. Patent No. 
5,639,613; 5,648,215; 5,489,508; Pandita et al., 1996, Proc. Am, Ass. Cancer Res. 37:559), 

25 any reduction in telomerase activity could reduce the aggressive nature of a cancer to a more 
manageable disease state (increasing the efficacy of traditional interventions). 

The invention provides compositions and methods useful for treatment of 
cancers of any of a wide variety of types, including solid tumors and leukemias. Types of 
cancer that may be treated include (but are not limited to): adenocarcinoma of the breast, 

30 prostate, and colon; all forms of bronchogenic carcinoma of the lung; myeloid; melanoma; 
hepatoma; neuroblastoma; papilloma; apudoma; choristoma; branchioma; malignant 
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carcinoid syndrome; carcinoid heart disease; carcinoma (e.g., Walker, basal cell, 

basosquamous, Brown-Pearce, ductal, Ehrlich tumor, in situ, Krebs 2, merkel cell, mucinous, 

non-small cell lung, oat cell, papillary, scirrhous, bronchiolar, bronchogenic, squamous cell, 

and transitional cell), histiocytic disorders; leukemia (e.g., B-cell, mixed-cell, null-cell, 
5 T-cell, T-cell chronic, HTLV-II-associated, lyphocytic acute, lymphocytic chronic, mast-cell, 

and myeloid); histiocytosis malignant; Hodgkin ? s disease; immunoproliferative small; 

non-Hodgkin f s lymphoma; plasmacytoma; reticuloendotheliosis; melanoma; 

chondroblastoma; chondroma; chondrosarcoma; fibroma; fibrosarcoma; giant cell tumors; 

histiocytoma; lipoma; liposarcoma; mesothelioma; myxoma; myxosarcoma; osteoma; 
10 osteosarcoma; Ewing's sarcoma; synovioma; adenofibroma; adenolymphoma; 

carcinosarcoma; chordoma; craniopharyngioma; dysgerminoma; hamartoma; 

mesenchymoma; mesonephroma; myosarcoma; ameloblastoma; cementoma; odontoma; 

teratoma; thymoma; trophoblastic tumor; adenocarcinoma; adenoma; cholangioma; 

cholesteatoma; cylindroma; cystadenocarcinoma; cystadenoma; granulosa cell tumor; 
15 gynandroblastoma; hepatoma; hidradenoma; islet cell tumor; leydig cell tumor; papilloma; 

Sertoli cell tumor; theca cell tumor; leiomyoma; leiomyosarcoma; myoblastoma; myoma; 

myosarcoma; rhabdomyoma; rhabdomyosarcoma; ependymoma; ganglioneuroma; glioma; 

medulloblastoma; meningioma; neurilemmoma; neuroblastoma; neuroepithelioma; 

neurofibroma; neuroma; paraganglioma; paraganglioma nonchromaffin; angiokeratoma; 
20 angiolymphoid hyperplasia with eosinophilia; angioma sclerosing; angiomatosis; 

glomangioma; hemangioendothelioma; hemangioma; hemangiopericytoma; 

hemangiosarcoma; lymphangioma; lymphangiomyoma; lymphangiosarcoma; pinealoma; 

carcinosarcoma; chondrosarcoma; cystosarcoma phyllodes; fibrosarcoma; hemangiosarcoma; 

leiomyosarcoma; leukosarcoma; liposarcoma; lymphangiosarcoma; myosarcoma; 
25 myxosarcoma; ovarian carcinoma; rhabdomyosarcoma; sarcoma (e.g., Ewing's, experimental, 

Kaposi's, and mast-cell); neoplasms (e.g., bone, breast, digestive system, colorectal, liver, 

pancreatic, pituitary, testicular, orbital, head and neck, central nervous system, acoustic, 

pelvic, respiratory tract, and urogenital); neurofibromatosis, and cervical dysplasia). The 

invention provides compositions and methods useful for treatment of other conditions in 
30 which cells have become immortalized or hyperproliferative, e.g., by disregulation (e.g., 

abnormally high expression) of hTRT, telomerase enzyme, or telomerase activity. 
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The present invention further provides compositions and methods for 
prevention of cancers, including anti-hTRT vaccines, gene therapy vectors that prevent 
telomerase activation, and gene therapy vectors that result in specific death of telomerase- 
positive cells. In a related aspect, the gene replacement therapy methods described below 
5 may be used for "treating" a genetic predilection for cancers. 

C) TREATMENT OF OTHER CONDITIONS 

The present invention also provides compositions and methods useful for 
treatment of diseases and disease conditions (in addition to cancers) characterized by under- 

1 0 or over-expression of telomerase or hTRT gene products. Examples include: diseases of cell 
proliferation, diseases resulting from cell senescence (particularly diseases of aging), 
immunological disorders, infertility, diseases of immune dysfunction, and others. 

Certain diseases of aging are characterized by cell senescence-associated 
changes due to reduced telomere length (compared to younger cells), resulting from the 

15 absence (or much lower levels) of telomerase activity in the cell. Decreased telomere length 
and decreased replicative capacity contribute to diseases such as those described below. 
Telomerase activity and telomere length can be increased by, for example, increasing levels 
of hTRT gene products (protein and mRNA) in the cell. A partial listing of conditions 
associated with cellular senescence in which hTRT expression can be therapeutic includes 

20 Alzheimer's disease, Parkinson's disease, Huntington's disease, and stroke; age-related 

diseases of the integument such as dermal atrophy, elastolysis and skin wrinkling, sebaceous 
gland hyperplasia, senile lentigo, graying of hair and hair loss, chronic skin ulcers, and 
age-related impairment of wound healing; degenerative joint disease; osteoporosis; 
age-related immune system impairment (e.g., involving cells such as B and T lymphocytes, 

25 monocytes, neutrophils, eosinophils, basophils, NK cells and their respective progenitors); 
age-related diseases of the vascular system including atherosclerosis, calcification, 
thrombosis, and aneurysms; diabetes, muscle atrophy, respiratory diseases, diseases of the 
liver and GI tract, metabolic diseases, endocrine diseases (e.g., disorders of the pituitary and 
adrenal gland), reproductive diseases, and age-related macular degeneration. These diseases 

30 and conditions can be treated by increasing the levels of hTRT gene products in the cell to 
increase telomere length, thereby restoring or imparting greater replicative capacity to the 



99 



# 



cell. Such methods can be carried out on cells cultured ex vivo or cells in vivo. In one 
embodiment, the cells are first treated to activate telomerase and lengthen telomeres, and then 
treated to inactivate the hTRT gene and telomerase activity. In a preferred embodiment, 
telomerase activity is generated by a vector of the invention in an embryonic germ or stem 

5 cell prior to or during differentiation. 

The present invention also provides methods and composition useful for 
treating infertility. Human germline cells (e.g., spermatogonia cells, their progenitors or 
descendants) are capable of indefinite proliferation and characterized by high telomerase 
activity. Abnormal or diminished levels of hTRT gene products can result, for example, in 

1 0 inadequate or abnormal production of spermatozoa, leading to infertility or disorders of 

reproduction. Accordingly, "telomerase-based" infertility can be treated using the methods 
and compositions described herein to increase telomerase levels. Similarly, because 
inhibition of telomerase may negatively impact spermatogenesis, oogenesis, and sperm and 
egg viability, the telomerase inhibitory compositions of the invention can have contraceptive 

15 effects when used to reduce hTRT gene product levels in germline cells. 

Further, the invention provides methods and composition useful for decreasing 
the proliferative potential of telomerase-positive cells such as activated lymphocytes and 
hematopoietic stem cells by reducing telomerase activity. Thus, the invention provide means 
for effecting immunosuppression. Conversely, the methods and reagents of the invention are 

20 useful for increasing telomerase activity and proliferative potential in cells, such as stem 

cells, that express a low level of telomerase or no telomerase prior to therapeutic intervention. 

D) MODES OF INTERVENTION 

As is clear from the foregoing discussion, modulation of the level of 
25 telomerase or telomerase activity of a cell can have a profound effect on the proliferative 
potential of the cell, and so has great utility in treatment of disease. As is also clear, this 
modulation may be either a decrease in telomerase activity or an increase in activity. The 
telomerase modulatory molecules of the invention can act through a number of mechanisms; 
some of these are described in this and the following subsections to aid the practitioner in 
30 selecting therapeutic agents. However, applicants do not intend to be limited to any particular 
mechanism of action for the novel therapeutic compounds, compositions and methods 
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described herein. 

Telomerase activity may be decreased through any of several mechanisms or 
combinations of mechanisms. One mechanism is the reduction of hTRT gene expression to 
reduce telomerase activity. This reduction can be at the level of transcription of the hTRT 
gene into mRNA, processing (e.g., splicing), nuclear transport or stability of mRNA, 
translation of mRNA to produce hTRT protein, or stability and function of hTRT protein. 
Another mechanism is interference with one or more activities of telomerase (e.g., the reverse 
transcriptase catalytic activity, or the hTR-binding activity) using inhibitory nucleic acids, 
polypeptides, or other agents (e.g., mimetics, small molecules, drugs and pro-drugs) that can 
be identified using the methods, or are provided by compositions, disclosed herein. Other 
mechanisms include sequestration of hTR and/or telomerase associated proteins, and 
interference with the assembly of the telomerase RNP from its component subunits. In a 
related mechanism, an hTRT promoter sequence is operably linked to a gene encoding a toxin 
and introduced into a cell; if or when hTRT transcriptional activators are expressed or 
activated in the cell, the toxin will be expressed, resulting in specific cell killing. 

A related method for reducing the proliferative capacity of a cell involves 
introducing an hTRT variant with low fidelity (i.e., one with a high, e.g., greater than 1%, 
error rate) such that aberrant telomeric repeats are synthesized. These aberrant repeats affect 
telomere protein binding and lead to chromosomal rearrangements and aberrations and/or 
lead to cell death. 

Similarly, telomerase activity may be increased through any of several 
mechanisms, or a combination of mechanisms. These include increasing the amount of hTRT 
in a cell. Usually this is carried out by introducing an hTRT polypeptide-encoding 
polynucleotide into the cell (e.g., a recombinantly produced polypeptide comprising an hTRT 
DNA sequence operably linked to a promoter, or a stable hTRT mRNA). Alternatively, a 
catalytically active hTRT polypeptide can itself be introduced into a cell or tissue, e.g., by 
microinjection or other means known in the art. In other mechanisms, expression from the 
endogenous hTRT gene or the stability of hTRT gene products in the cell can be increased. 
Telomerase activity in a cell can also be increased by interfering with the interaction of 
endogenous telomerase inhibitors and the telomerase RNP, or endogenous hTRT 
transcription repressors and the hTRT gene; by increasing expression or activity of hTRT 
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transcription activators; and other means apparent to those of skill upon review of this 
disclosure. 

E) INTERVENTION AGENTS 
5 1) TRT PROTEINS & PEPTIDES 

In one embodiment, the invention provides telomerase modulatory 
polypeptides (i.e., proteins, polypeptides, and peptides) that increase or reduce telomerase 
activity which can be introduced into a target cell directly (e.g., by injection, liposome- 
mediated fusion, application of a hydrogel to the tumor [e.g., melanoma] surface, fusion or 

1 0 attachment to herpes virus structural protein VP22, and other means described herein and 

known in the art). In a second embodiment, telomerase modulatory proteins and peptides of 
the invention are expressed in a cell by introducing a nucleic acid (e.g., a DNA expression 
vector or mRNA) encoding the desired protein or peptide into the cell. Expression may be 
either constitutive or inducible depending on the vector and choice of promoter (see 

1 5 discussion below). Messenger RNA preparations encoding hTRT are especially useful when 
only transient expression (e.g., transient activation of telomerase) is desired. Methods for 
introduction and expression of nucleic acids into a cell are well known in the art (also, see 
elsewhere in this specification, e.g., sections on oligonucleotides, gene therapy methods). 

In one aspect of the invention, a telomerase modulatory polypeptide that 

20 increases telomerase activity in a cell is provided. In one embodiment, the polypeptide is a 
catalytically active hTRT polypeptide capable of directing the synthesis (in conjunction with 
an RNA template such as hTR) of human telomeric DNA. This activity can be measured, as 
discussed above, e.g., using a telomerase activity assay such as a TRAP assay. In one 
embodiment, the polypeptide is a full-length hTRT protein, having a sequence of, or 

25 substantially identical to, the sequence of 1 1 32 residues of SEQUENCE ID No: 2. In another 
embodiment, the polypeptide is a variant of the hTRT protein of SEQUENCE ID No: 2, such 
as a fusion polypeptide, derivatized polypeptide, truncated polypeptide, conservatively 
substituted polypeptide, activity-modified polypeptide, or the like. A fusion or derivatized 
protein may include a targeting moiety that increases the ability of the polypeptide to traverse 

30 a cell membrane or causes the polypeptide to be delivered to a specified cell type (e.g., liver 
cells or tumor cells) preferentially or cell compartment (e.g., nuclear compartment) 
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preferentially. Examples of targeting moieties include lipid tails, amino acid sequences such 
as antennapoedia peptide or a nuclear localization signal (NLS; e.g.,Xenopus nucleoplasm^ 
Robbins et al., 1991, Cell 64:615). Naturally occurring hTRT protein (e.g., having a 
sequence of, or substantially identical to, SEQUENCE ID NO: 2) acts in the cell nucleus. 

5 Thus, it is likely that one or more subsequences of SEQUENCE ID NO: 2, such as residues 
193-196 (PRRR) and residues 235-240 (PKRPRR) act as a nuclear localization signal. The 
small regions are likely NLSs based on the observation that many NLSs comprise a 4 residue 
pattern composed of basic amino acids (K or R), or composed of three basic amino acids (K 
or R) and H or P; a pattern starting with P and followed within 3 residues by a basic segment 

10 containing 3 K or R residues out of 4 residues (see, e.g., Nakai et al., 1992, Genomics 

14:897). Deletion of one or both of these sequences and/or additional localization sequences 
is expected to interfere with hTRT transport to the nucleus and/or increase hTRT turnover, 
and is useful for preventing access of telomerase to its nuclear substrates and decreasing 
proliferative potential. Moreover, a variant hTRT polypeptide lacking NLS may assemble 

15 into an RNP that will not be able to maintain telomere length, because the resulting enzyme 
cannot enter the nucleus. 

The hTRT polypeptides of the invention will typically be associated in the 
target cell with a telomerase RNA, such as hTR, especially when they are used to increase 
telomerase activity in a cell. In one embodiment, an introduced hTRT polypeptide associates 

20 with an endogenous hTR to form a catalytically active RNP (e.g., an RNP comprising the 
hTR and a full-length polypeptide having a sequence of SEQUENCE ID NO:2). The RNP 
so-formed may also associate with other, e.g., telomerase-associated, proteins. In other 
embodiments, telomerase RNP (containing hTRT protein, hTR and optionally other 
components) is introduced as a complex to the target cell. 

25 In a related embodiment, an hTRT expression vector is introduced into a cell 

(or progeny of a cell) into which a telomerase RNA (e.g., hTR) expression vector is 
simultaneously, subsequently or has been previously introduced. In this embodiment, hTRT 
protein and telomerase RNA are coexpressed in the cell and assemble to form a telomerase 
RNP. A preferred telomerase RNA is hTR. An expression vector useful for expression of 

30 hTR in a cell is described supra (see U.S. Patent 5,583,016). In yet another embodiment, the 
hTRT polypeptide and hTR RNA (or equivalent) are associated in vitro to form a complex, 
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which is then introduced into the target cells, e.g., by liposome mediated transfer. 

In another aspect, the invention provides hTRT polypeptides useful for 
reducing telomerase activity in a cell. As above, these "inhibitory" polypeptides can be 
introduced directly, or by expression of recombinant nucleic acids in the cell. It will be 
recognized that peptide mimetics or polypeptides comprising nonstandard amino acids (i.e., 
other than the 20 amino acids encoded by the genetic code or their normal derivatives) will 
typically be introduced directly. 

In one embodiment, inhibition of telomerase activity results from the 
sequestration of a component required for accurate telomere elongation. Examples of such 
components are hTRT and hTR. Thus, administration of a polypeptide that binds hTR, but 
which does not have telomerase catalytic activity, can reduce endogenous telomerase activity 
in the cell. In a related embodiment, the hTRT polypeptide may bind a cell component other 
than hTR, such as one or more telomerase-associated proteins, thereby interfering with 
telomerase activity in the cell. 

In another embodiment, hTRT polypeptides of the invention interfere (e.g., by 
competition) with the interaction of endogenously expressed hTRT protein and another 
cellular component required for telomerase function, such as hTR, telomeric DNA, 
telomerase-associated proteins, telomere-associated proteins, telomeres, cell cycle control 
proteins, DNA repair enzymes, histone or non-histone chromosomal proteins, or others. 

In selecting molecules (e.g., polypeptides) of the invention that affect the 
interaction of endogenously expressed hTRT protein and other cellular components, one may 
prefer molecules that include one or more of the conserved motifs of the hTRT protein, as 
described herein. The evolutionary conservation of these regions indicates the important 
function in the proper functioning of human telomerase contributed by these motifs, and the 
motifs are thus generally useful sites for changing hTRT protein function to create variant 
hTRT proteins of the invention. Thus, variant hTRT polypeptides having mutations in 
conserved motifs will be particularly useful for some applications of the invention. 

In another embodiment, expression of the endogenous hTRT gene is repressed 
by introduction into the cell of a large amount of hTRT polypeptide (e.g., typically at least 
about 2-fold more than the endogenous level, more often at least about 10- to about 100-fold) 
which acts via a feedback loop to inhibit transcription of the hTRT gene, processing of the 
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hTRT pre-mRNA, translation of the hTRT mRNA, or assembly and transport of the 
telomerase RNP. 

2) OLIGONUCLEOTIDES 
5 a) ANTISENSE CONSTRUCTS 

The invention provides methods and antisense oligonucleotide or 
polynucleotide reagents which can be used to reduce expression of hTRT gene products in 
vitro or in vivo. Administration of the antisense reagents of the invention to a target cell 
results in reduced telomerase activity, and is particularly useful for treatment of diseases 

1 0 characterized by high telomerase activity (e.g., cancers). Without intending to be limited to 
any particular mechanism, it is believed that antisense oligonucleotides bind to, and interfere 
with the translation of, the sense hTRT mRNA. Alternatively, the antisense molecule may 
render the hTRT mRNA susceptible to nuclease digestion, interfere with transcription, 
interfere with processing, localization or otherwise with RNA precursors ("pre-mRNA"), 

1 5 repress transcription of mRNA from the hTRT gene, or act through some other mechanism. 
However, the particular mechanism by which the antisense molecule reduces hTRT 

expression is not critical. 

The antisense polynucleotides of the invention comprise an antisense sequence 
of at least 7 to 10 to typically 20 or more nucleotides that specifically hybridize to a sequence 

20 from mRNA encoding hTRT or mRNA transcribed from the hTRT gene. More often, the 
antisense polynucleotide of the invention is from about 10 to about 50 nucleotides in length 
or from about 14 to about 35 nucleotides in length. In other embodiments, antisense 
polynucleotides are polynucleotides of less than about 100 nucleotides or less than about 200 
nucleotides. In general, the antisense polynucleotide should be long enough to form a stable 

25 duplex but short enough, depending on the mode of delivery, to administer in vivo, if desired. 
The minimum length of a polynucleotide required for specific hybridization to a target 
sequence depends on several factors, such as G/C content, positioning of mismatched bases 
(if any), degree of uniqueness of the sequence as compared to the population of target 
polynucleotides, and chemical nature of the polynucleotide (e.g., methylphosphonate 

30 backbone, peptide nucleic acid, phosphorothioate), among other factors. 

Generally, to assure specific hybridization, the antisense sequence is 
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substantially complementary to the target hTRT mRNA sequence. In certain embodiments, 
the antisense sequence is exactly complementary to the target sequence. The antisense 
polynucleotides may also include, however, nucleotide substitutions, additions, deletions, 
transitions, transpositions, or modifications, or other nucleic acid sequences or non-nucleic 
acid moieties so long as specific binding to the relevant target sequence corresponding to 
hTRT RNA or its gene is retained as a functional property of the polynucleotide. 

In one embodiment, the antisense sequence is complementary to relatively 
accessible sequences of the hTRT mRNA (e.g., relatively devoid of secondary structure). 
This can be determined by analyzing predicted RNA secondary structures using, for example, 
the MFOLD program (Genetics Computer Group, Madison WI) and testing in vitro or in vivo 
as is known in the art. Examples of oligonucleotides that may be tested in cells for antisense 
suppression of hTRT function are those capable of hybridizing to (i.e., substantially 
complementary to) the following positions from SEQUENCE ID NO:l : 40-60; 260-280; 
500-520; 770-790; 885-905; 1000-1020 ; 1300-1320; 1520-1540; 2110-2130; 2295-2315; 
2450-2470; 2670-2690; 3080-31 10; 3140-3160; and 3690-3710. Another useful method for 
identifying effective antisense compositions uses combinatorial arrays of oligonucleotides 
(see, e.g., Milner et al., 1997, Nature Biotechnology 15:537). 

The invention also provides an antisense polynucleotide that has sequences in 
addition to the antisense sequence (i.e., in addition to anti-hTRT-sense sequence). In this 
case, the antisense sequence is contained within a polynucleotide of longer sequence. In 
another embodiment, the sequence of the polynucleotide consists essentially of, or is, the 
antisense sequence. 

The antisense nucleic acids (DNA, RNA, modified, analogues, and the like) 
can be made using any suitable method for producing a nucleic acid, such as the chemical 
synthesis and recombinant methods disclosed herein. In one embodiment, for example, 
antisense RNA molecules of the invention may be prepared by de novo chemical synthesis or 
by cloning. For example, an antisense RNA that hybridizes to hTRT mRNA can be made by 
inserting (ligating) an hTRT DNA sequence (e.g., SEQUENCE ID No; 1, or fragment 
thereof) in reverse orientation operably linked to a promoter in a vector (e.g., plasmid). 
Provided that the promoter and, preferably termination and polyadenylation signals, are 
properly positioned, the strand of the inserted sequence corresponding to the noncoding 
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strand will be transcribed and act as an antisense oligonucleotide of the invention. 

The antisense oligonucleotides of the invention can be used to inhibit 
telomerase activity in cell-free extracts, cells, and animals, including mammals and humans. 
For example, the phosphorothioate antisense oligonucleotides: 

A) S'-GGCATCGCGGGGGTGGCCGGG 

B) 5-CAGCGGGGAGCGCGCGGCATC 

C) S'-CAGCACCTCGCGGTAGTGGCT' 

D) 5'-GGACACCTGGCGGAAGGAGGG 

can be used to inhibit telomerase activity. At 10 micromolar concentration each 
oligonucleotide, mixtures of oligonucleotides A and B; A, B, C, and D; and A, C, and D 
inhibited telomerase activity in 293 cells when treated once per day for seven days. Inhibition 
was also observed when an antisense hTR molecule (5-GCTCTAGAATGAAGGGTG-3') 
was used in combination with oligonucleotides A, B, and C; A, B, and D; and A and C. 
Useful control oligonucleotides in such experiments include: 

51) 5-GCGACGACTGACATTGGCCGG 

52) 5 '-GGCTCGAAGTAGC ACCGGTGC 

53) 5'-GTGGGAACAGGCCGATGTCCC 

To determine the optimum antisense oligonucleotide of the invention for the 
particular application of interest, one can perform a scan using antisense oligonucleotide sets 
of the invention. One illustrative set is the set of 30-mer oligonucleotides that span the hTRT 
mRNA and are offset one from the next by fifteen nucleotides (i.e., ONI corresponds to 
positions 1-30 and is TCCCACGTGCGCAGCAGGACGCAGCGCTGC, ON2 corresponds 
to positions 16-45 and is GCCGGGGCCAGGGCTTCCCACGTGCGCAGC, and ON3 
corresponds to positions 31-60 and is GGCATCGCGGGGGTGGCCGGGGCCAGGGCT, 
and so on to the end of the mRNA). Each member of this set can be tested for inhibitory 
activity as disclosed herein. Those oligonucleotides that show inhibitory activity under the 
conditions of interest then identify a region of interest, and other oligonucleotides of the 
invention corresponding to the region of interest (i.e., 8-mers, 10-mers, 15-mers, and so on) 
can be tested to identify the oligonucleotide with the preferred activity for the application. 
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Exemplary antisense oligonucleotides include 5 ! -GGCATCGCGGGGGTG 
GCCGGGGCCAGGGCT-3' (corresponding to nucleotide positions 31-60 of hTRT); 5 f - 
GCGCA GCGTGCCAGCAGGTGAACCAGCACG-3' (corresponding to positions 496-525); 
5'- GCCCGTTCGCATCCCAGACGCCTTCGGGGT-3 , (corresponding to positions 631- 
660); and S'-ACGCTATGGTTCCAGGCCCGTTCGCATCCC-S 1 (corresponding to positions 
646-675). When ACHN cells (NCI #503755) or 293 cells were treated for three days with 10 
jjM of phosphorothioate oligonucleotides with any of the four sequences supra, inhibition of 
telomerase activity by about 50%-90% (compared to control untreated cells) as measured by a 
TRAP assay, was observed. 

For general methods relating to antisense polynucleotides, see Antisense 
RNA AND DNA, (1988), D.A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY). Seealso, Dagle etal., 1991, Nucleic Acids Research, 19:1805. Forareviewof 
antisense therapy, see, e.g., Uhlmann et aL, Chem. Reviews, 90:543-584 (1990). 

b) TRIPLEX OLIGO- AND POLYNUCLEOTIDES 

The present invention provides oligo- and polynucleotides (e.g., DNA, RNA, 
PNA or the like) that bind to double-stranded or duplex hTRT nucleic acids (e.g., in a folded 
region of the hTRT RNA or in the hTRT gene), forming a triple helix-containing, or "triplex" 
nucleic acid. Triple helix formation results in inhibition of hTRT expression by, for example, 
preventing transcription of the hTRT gene, thus reducing or eliminating telomerase activity in 
a cell. Without intending to be bound by any particular mechanism, it is believed that triple 
helix pairing compromises the ability of the double helix to open sufficiently for the binding 
of polymerases, transcription factors, or regulatory molecules to occur. 

Triplex oligo- and polynucleotides of the invention are constructed using the 
base-pairing rules of triple helix formation (see, e.g., Cheng et aL, 1988, J. Biol. Chem. 263: 
151 10; Ferrin and Camerini-Otero, 1991, Science 354:1494; Ramdas et aL, 1989, J. Biol 
Chem. 264:17395; Strobel et aL, 1991, Science 254:1639; and Rigas et aL, 1986, Proc. Natl 
Acad Set U.S.A. 83: 9591 ; each of which is incorporated herein by reference) and the hTRT 
mRNA and/or gene sequence. Typically, the triplex-forming oligonucleotides of the 
invention comprise a specific sequence of from about 10 to at least about 25 nucleotides or 
longer "complementary" to a specific sequence in the hTRT RNA or gene (i.e., large enough 
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to form a stable triple helix, but small enough, depending on the mode of delivery, to 
administer in vivo, if desired). In this context, "complementary" means able to form a stable 
triple helix. In one embodiment, oligonucleotides are designed to bind specifically to the 
regulatory regions of the hTRT gene (e.g., the hTRT 5 T -flanking sequence, promoters, and 
enhancers) or to the transcription initiation site, (e.g., between -10 and +10 from the 
transcription initiation site). For a review of recent therapeutic advances using triplex DNA, 
see Gee et aL, in Huber and Carr, 1994, Molecular and Immunologic Approaches, Futura 
Publishing Co, Mt Kisco NY and Rininsland et aL, 1997, Proc. Natl Acad. ScL USA 
94:5854, which are both incorporated herein by reference. 

c) RIBOZYMES 

The present invention also provides ribozymes useful for inhibition of 
telomerase activity. The ribozymes of the invention bind and specifically cleave and 
inactivate hTRT mRNA. Useful ribozymes can comprise 5 1 - and 3 '-terminal sequences 
complementary to the hTRT mRNA and can be engineered by one of skill on the basis of the 
hTRT mRNA sequence disclosed herein (see PCT publication WO 93/23572, supra). 
Ribozymes of the invention include those having characteristics of group I intron ribozymes 
(Cech, 1995, Biotechnology 13:323) and others of hammerhead ribozymes (Edgington, 1992, 
Biotechnology 10:256). 

Ribozymes of the invention include those having cleavage sites such as GUA, 
GUU and GUC. Other optimum cleavage sites for ribozyme-mediated inhibition of 
telomerase activity in accordance with the present invention include those described in PCT 
publications WO 94/02595 and WO 93/23569, both incorporated herein by reference. Short 
RNA oligonucleotides between 15 and 20 ribonucleotides in length corresponding to the 
region of the target hTRT gene containing the cleavage site can be evaluated for secondary 
structural features that may render the oligonucleotide more desirable. The suitability of 
cleavage sites may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays, or by testing for in 
vitro ribozyme activity in accordance with standard procedures known in the art. 

As described by Hu et aL, PCT publication WO 94/03596, incorporated herein 
by reference, antisense and ribozyme functions can be combined in a single oligonucleotide. 
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Moreover, ribozymes can comprise one or more modified nucleotides or modified linkages 
between nucleotides, as described above in conjunction with the description of illustrative 
antisense oligonucleotides of the invention. 

In one embodiment, the ribozymes of the invention are generated in vitro and 
5 introduced into a cell or patient. In another embodiment, gene therapy methods are used for 
expression of ribozymes in a target cell ex vivo or in vivo. 

d) ADMINISTRATION OF OLIGONUCLEOTIDES 

Typically, the therapeutic methods of the invention involve the administration 
10 of an oligonucleotide that functions to inhibit or stimulate telomerase activity under in vivo 
physiological conditions, and is relatively stable under those conditions for a period of time 
sufficient for a therapeutic effect. As noted above, modified nucleic acids may be useful in 
imparting such stability, as well as for targeting delivery of the oligonucleotide to the desired 
tissue, organ, or cell. 

1 5 Oligo- and poly-nucleotides can be delivered directly as a drug in a suitable 

pharmaceutical formulation, or indirectly by means of introducing a nucleic acid into a cell, 
including liposomes, immunoliposomes, ballistics, direct uptake into cells, and the like as 
described herein. For treatment of disease, the oligonucleotides of the invention will be 
administered to a patient in a therapeutically effective amount. A therapeutically effective 

20 amount is an amount sufficient to ameliorate the symptoms of the disease or modulate 

telomerase activity in the target cell, e.g., as can be measured using a TRAP assay or other 
suitable assay of telomerase biological function. Methods useful for delivery of 
oligonucleotides for therapeutic purposes are described in U.S. Patent 5,272,065, 
incorporated herein by reference. Other details of administration of pharmaceutically active 

25 compounds are provided below. In another embodiment, oligo- and poly-nucleotides can be 
delivered using gene therapy and recombinant DNA expression plasmids of the invention. 



3) GENE THERAPY 

Gene therapy refers to the introduction of an otherwise exogenous 
30 polynucleotide which produces a medically useful phenotypic effect upon the (typically) 

mammalian cell(s) into which it is transferred. In one aspect, the present invention provides 
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gene therapy methods and compositions for treatment of telomerase-associated conditions. In 
illustrative embodiments, gene therapy involves introducing into a cell a vector that expresses 
an hTRT gene product (such as an hTRT protein substantially similar to the hTRT 
polypeptide having a sequence of SEQUENCE ID NO: 2, e.g., to increase telomerase 
5 activity, or an inhibitory hTRT polypeptide to reduce activity), expresses a nucleic acid 
having an hTRT gene or mRNA sequence (such as an antisense RNA, e.g., to reduce 
telomerase activity), expresses a polypeptide or polynucleotide that otherwise affects 
expression of hTRT gene products (e.g., a ribozyme directed to hTRT mRNA to reduce 
telomerase activity), or replaces or disrupts an endogenous hTRT sequence (e.g., gene 
1 0 replacement and "gene knockout," respectively). Numerous other embodiments will be 
evident to one of skill upon review of the disclosure herein. In one embodiment, a vector 
encoding hTR is also introduced. In another embodiment, vectors encoding telomerase- 
associated proteins are also introduced with or without a vector for hTR. 

Vectors useful in hTRT gene therapy can be viral or nonviral, and include 
1 5 those described supra in relation to the hTRT expression systems of the invention. It will be 
understood by those of skill in the art that gene therapy vectors may comprise promoters and 
other regulatory or processing sequences, such as are described in this disclosure. Usually the 
vector will comprise a promoter and, optionally, an enhancer (separate from any contained 
within the promoter sequences) that serve to drive transcription of an oligoribonucleotide, as 
20 well as other regulatory elements that provide for episomal maintenance or chromosomal 

integration and for high-level transcription, if desired. A plasmid useful for gene therapy can 
comprise other functional elements, such as selectable markers, identification regions, and 
other sequences. The additional sequences can have roles in conferring stability both outside 
and within a cell, targeting delivery of hTRT nucleotide sequences (sense or antisense) to a 
25 specified organ, tissue, or cell population, mediating entry into a cell, mediating entry into the 
nucleus of a cell and/or mediating integration within nuclear DNA. For example, 
aptamer-like DNA structures, or other protein binding moieties sites can be used to mediate 
binding of a vector to cell surface receptors or to serum proteins that bind to a receptor 
thereby increasing the efficiency of DNA transfer into the cell. Other DNA sites and 
30 structures can directly or indirectly bind to receptors in the nuclear membrane or to other 

proteins that go into the nucleus, thereby facilitating nuclear uptake of a vector. Other DNA 
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sequences can directly or indirectly affect the efficiency of integration. 

Suitable gene therapy vectors may, or may not, have an origin of replication. 
For example, it is useful to include an origin of replication in a vector for propagation of the 
vector prior to administration to a patient. However, the origin of replication can often be 
5 removed before administration if the vector is designed to integrate into host chromosomal 
DNA or bind to host mRNA or DNA. In some situations (e.g., tumor cells) it may not be 
necessary for the exogenous DNA to integrate stably into the transduced cell, because 
transient expression may suffice to kill the tumor cells. 

As noted, the present invention also provides methods and reagents for gene 

1 0 replacement therapy (i.e., replacement by homologous recombination of an endogenous 
hTRT gene with a recombinant gene). Vectors specifically designed for integration by 
homologous recombination may be used. Important factors for optimizing homologous 
recombination include the degree of sequence identity and length of homology to 
chromosomal sequences. The specific sequence mediating homologous recombination is also 

1 5 important, because integration occurs much more easily in transcriptionally active DNA. 
Methods and materials for constructing homologous targeting constructs are described by 
e.g., Mansour et al., 1988, Nature 336: 348; Bradley et al, 1992, Bio/Technology 10: 534. 
See also, U.S. Patent Nos. 5,627,059; 5,487,992; 5,631,153; and 5,464,764. In one 
embodiment, gene replacement therapy involves altering or replacing all or a portion of the 

20 regulatory sequences controlling expression of the hTRT gene that is to be regulated. For 
example, the hTRT promoter sequences (e.g., such as are found in SEQUENCE ID NO: 6) 
may be disrupted (to decrease hTRT expression or to abolish a transcriptional control site) or 
an exogenous promoter (e.g., to increase hTRT expression) substituted. 

The invention also provides methods and reagents for hTRT "gene knockout" 

25 (i.e., deletion or disruption by homologous recombination of an endogenous hTRT gene using 
a recombinantly produced vector). In gene knockout, the targeted sequences can be 
regulatory sequences (e.g., the hTRT promoter), or RNA or protein coding sequences. The 
use of homologous recombination to alter expression of endogenous genes is described in 
detail in U.S. Patent No. 5,272,071 (and the U.S. Patents cited supra), WO 91/09955, WO 

30 93/09222, WO 96/2941 1, WO 95/31560, and WO 91/12650. See also, Moynahan et al., 
1996, Hum. Mol Genet. 5:875. 
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The invention further provides methods for specifically killing telomerase- 
positive cells, or preventing transformation of telomerase negative cells to a telomeratse 
positive state, using the hTRT gene promoter to regulate expression of a protein toxic to the 
cell. As shown in Example 14, an hTRT promoter sequence may be operably linked to a 
5 reporter gene such that activation of the promoter results in expression of the protein encoded 
by the reporter gene. If, instead of a reporter protein, the encoded protein is toxic to the cell, 
activation of the promoter leads to cell morbidity or death. In one embodiment of the present 
invention, a vector comprising an hTRT promoter operably linked to a gene encoding a toxic 
protein is introduced into cells, such as human cells, e.g., cells in a human patient, resulting in 
10 cell death of cells in which hTRT promoter activating factors are expressed, such as cancer 

cells. In a related embodiment, the encoded protein is not itself toxic to a cell, but encodes an 
activity that renders the cell sensitive to an otherwise nontoxic drug. For example, tumors 
can be treated by introducing an hTRT-promoter-Herpes thymidine kinase (TK) gene fusion 
construct into tumor cells, and administering gancyclovir or the equivalent (see, e.g., Moolton 
15 and Wells, 1990, J. Nat'l Cane. Inst. 82:297). The art knows of numerous other suitable toxic 
or potentially toxic proteins and systems (using promoter sequences other that hTRT) that 
may be modified and applied in accordance with the present invention by one of skill in the 
art upon review of this disclosure. 

Gene therapy vectors may be introduced into cells or tissues in vivo, in vitro or 
20 ex vivo. For ex vivo therapy, vectors may be introduced into cells, e.g., stem cells, taken from 
the patient and clonally propagated for autologous transplant back into the same patient (see, 
e.g., U.S. Patent Nos. 5,399,493 and 5,437,994, the disclosures of which are herein 
incorporated by reference). Cells that can be targeted for hTRT gene therapy aimed at 
increasing the telomerase activity of a target cell include, but are not limited to, embryonic 
25 stem or germ cells, particularly primate or human cells, as noted supra, hematopoietic stem 
cells (AIDS and post-chemotherapy), vascular endothelial cells (cardiac and cerebral vascular 
disease), skin fibroblasts and basal skin keratinocytes (wound healing and burns), 
chondrocytes (arthritis), brain astrocytes and microglial cells (Alzheimer's Disease), 
osteoblasts (osteoporosis), retinal cells (eye diseases), and pancreatic islet cells (Type I 
30 diabetes) and any of the cells listed in Table 3, infra, as well as any other cell types known to 
divide. 
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In one embodiment of the invention, an inducible promoter operably linked to 
a TRT, such as hTRT, coding sequence (or variant) is used to modulate the proliferative 
capacity of cells in vivo or in vitro. In a particular embodiment, for example, insulin- 
producing pancreatic cells transfected with an hTRT expression vector under the control of an 
inducible promoter are introduced into a patient. The proliferative capacity of the cells can 
then be controlled by administration to the patient of the promoter activating agent (e.g., 
tetracycline) to enable the cells to multiply more than otherwise would have been possible. 
Cell proliferation can then be terminated, continued, or reinitiated as desired by the treating 
physician. 

4) VACCINES AND ANTIBODIES 

Immuogenic peptides or polypeptides having an hTRT sequence can be used 
to elicit an anti-hTRT immune response in a patient (i.e., act as a vaccine). Exemplary 
immunogenic hTRT peptides and polypeptides are described infra in Examples 6 and 8. An 
immune response can also be raised by delivery of plasmid vectors encoding the polypeptide 
of interest (i.e., administration of "naked DNA"). The nucleic acids of interest can be 
delivered by injection, liposomes, or other means of administration. In one embodiment, 
immunization modes that elicit in the subject a Class I MHC restricted cytotoxic lymphocyte 
response against telomerase expressing cells are chosen. Once immunized, the individual or 
animal will elicit a heightened immune response against cells expressing high levels of 
telomerase (e.g., malignant cells). 

Anti-hTRT antibodies, e.g., murine, human, or humanized monoclonal 
antibodies may also be administered to a patient (e.g., passive immunization) to effect an 
immune response against telomerase-expressing cells. 

F) PHARMACEUTICAL COMPOSITIONS 

In related aspects, the invention provides pharmaceutical compositions that 
comprise hTRT oligo- and poly-nucleotides, polypeptides, and antibodies, agonists, 
antagonists, or inhibitors, alone or in combination with at least one other agent, such as a 
stabilizing compound, diluent, carrier, or another active ingredient or agent. 

The therapeutic agents of the invention may be administered in any sterile, 
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biocompatible pharmaceutical carrier, including, but not limited to, saline, buffered saline, 
dextrose, and water. Any of these molecules can be administered to a patient alone, or in 
combination with other agents, drugs or hormones, in pharmaceutical compositions where it 
is mixed with suitable excipient(s), adjuvants, and/or pharmaceutical^ acceptable carriers. In 
one embodiment of the present invention, the pharmaceutically acceptable carrier is 
pharmaceutically inert. 

Administration of pharmaceutical compositions is accomplished orally or 
parenterally. Methods of parenteral delivery include topical, intra-arterial (e.g., directly to the 
tumor), intramuscular, subcutaneous, intramedullary, intrathecal, intraventricular, 
intravenous, intraperitoneal, or intranasal administration. In addition to the active 
ingredients, these pharmaceutical compositions may contain suitable pharmaceutically 
acceptable carriers comprising excipients and other compounds that facilitate processing of 
the active compounds into preparations which can be used pharmaceutically. Further details 
on techniques for formulation and administration may be found in the latest edition of 
"Remington's Pharmaceutical Sciences" (Maack Publishing Co, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as 
tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, etc., suitable for 
ingestion by the patient. See PCT publication WO 93/23572. 

Pharmaceutical preparations for oral use can be obtained through combination 
of active compounds with solid excipient, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable additional compounds, if desired, to 
obtain tablets or dragee cores. Suitable excipients are carbohydrate or protein fillers include, 
but are not limited to sugars, including lactose, sucrose, mannitol, or sorbitol; starch from 
corn, wheat, rice, potato, or other plants; cellulose such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; and gums including 
arabic and tragacanth; as well as proteins such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl 
pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium alginate. 

Dragee cores are provided with suitable coatings such as concentrated sugar 
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solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents 
or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 
product identification or to characterize the quantity of active compound (i.e., dosage). 
5 Pharmaceutical preparations which can be used orally include push-fit 

capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a coating such 
as glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with a filler or 
binders such as lactose or starches, lubricants such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended 
1 0 in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycol with or 
without stabilizers. 

Pharmaceutical formulations for parenteral administration include aqueous 
solutions of active compounds. For injection, the pharmaceutical compositions of the 
invention may be formulated in aqueous solutions, preferably in physiologically compatible 
15 buffers such as Hank's solution, Ringer's solution, or physiologically buffered saline. 

Aqueous injection suspensions may contain substances which increase the viscosity of the 
suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, 
suspensions of the active compounds may be prepared as appropriate oily injection 
suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or 
20 synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Optionally, the 
suspension may also contain suitable stabilizers or agents which increase the solubility of the 
compounds to allow for the preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular 
barrier to be permeated are used in the formulation. Such penetrants are generally known in 
25 the art. 

The pharmaceutical compositions of the present invention may be 
manufactured in a manner similar to that known in the art (e.g., by means of conventional 
mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, 
entrapping or lyophilizing processes). 
30 The pharmaceutical composition may be provided as a salt and can be formed 

with many acids, including but not limited to hydrochloric, sulfuric, acetic, lactic, tartaric, 
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malic, succinic, etc. Salts tend to be more soluble in aqueous or other protonic solvents that 
are the corresponding free base forms. In other cases, the preferred preparation may be a 
lyophilized powder in 1 mM-50 mM histidine, 0.1%-2% sucrose, 2%-7% mannitol at a pH 
range of 4.5 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions comprising a compound of the invention 
formulated in a acceptable carrier have been prepared, they can be placed in an appropriate 
container and labeled for treatment of an indicated condition. For administration of human 
telomerase proteins and nucleic acids, such labeling would include amount, frequency and 
method of administration. 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve 
the intended purpose. "Therapeutically effective amount" or "pharmacologically effective 
amount" are well recognized phrases and refer to that amount of an agent effective to produce 
the intended pharmacological result. Thus, a therapeutically effective amount is an amount 
sufficient to ameliorate the symptoms of the disease being treated. One useful assay in 
ascertaining an effective amount for a given application (e.g., a therapeutically effective 
amount) is measuring the effect on telomerase activity in a target cell. The amount actually 
administered will be dependent upon the individual to which treatment is to be applied, and 
will preferably be an optimized amount such that the desired effect is achieved without 
significant side-effects. The determination of a therapeutically effective dose is well within 
the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially 
either in cell culture assays or in any appropriate animal model. The animal model is also 
used to achieve a desirable concentration range and route of administration. Such 
information can then be used to determine useful doses and routes for administration in 
humans. 

A therapeutically effective amount refers to that amount of protein, 
polypeptide, peptide, antibody, oligo- or polynucleotide, agonist or antagonists which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity of such compounds 
can be determined by standard pharmaceutical procedures in cell cultures or experimental 
animals {e.g., ED 50 , the dose therapeutically effective in 50% of the population; and LD 50 , the 
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dose lethal to 50% of the population). The dose ratio between therapeutic and toxic effects is 
the therapeutic index, and it can be expressed as the ratio, ED 50 /LD 50 . Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained from 
cell culture assays and animal studies is used in formulating a range of dosage for human use. 
5 The dosage of such compounds lies preferably within a range of circulating concentrations 
that include the ED 50 with little or no toxicity. The dosage varies within this range depending 
upon the dosage form employed, sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of the patient 
to be treated. Dosage and administration are adjusted to provide sufficient levels of the active 
1 0 moiety or to maintain the desired effect. Additional factors which may be taken into account 
include the severity of the disease state (e.g. , tumor size and location; age, weight and gender 
of the patient; diet, time and frequency of administration, drug combination(s), reaction 
sensitivities, and tolerance/response to therapy). Long acting pharmaceutical compositions 
might be administered every 3 to 4 days, every week, or once every two weeks depending on 
1 5 half-life and clearance rate of the particular formulation. Guidance as to particular dosages 
and methods of delivery is provided in the literature (see, US Patent Nos. 4,657,760; 
5,206,344; and 5,225,212, herein incorporated by reference). Those skilled in the art will 
typically employ different formulations for nucleotides than for proteins or their inhibitors. 
Similarly, delivery of polynucleotides or polypeptides can be specific to particular cells, 
20 conditions, locations, and the like. 

VIIL INCREASING PROLIFERATIVE CAPACITY AND PRODUCTION OF 
IMMORTALIZED CELLS, CELL LINES, AND ANIMALS 

As discussed above, most vertebrate cells senesce after a finite number of 
25 divisions in culture (e.g., 50 to 100 divisions). Certain variant cells, however, are able to 
divide indefinitely in culture (e.g., HeLa cells, 293 cells) and, for this reason, are useful for 
research and industrial applications. Usually these immortal cell lines are derived from 
spontaneously arising tumors, or by transformation by exposure to radiation or a tumor- 
inducing virus or chemical. Unfortunately, a limited selection of cell lines, especially human 
30 cell lines representing differentiated cell function, is available. Moreover, the immortal cell 
lines presently available are characterized by chromosomal abnormalities (e.g., aneuploidy, 
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gene rearrangements, or mutations). Further, many long-established cell lines are relatively 
undifferentiated (e.g., they do not produce highly specialized products of the sort that 
uniquely characterize particular tissues or organs). Thus, there is a need for new methods of 
generating immortal cells, especially human cells. As used herein, the term "immortalized 

5 cells" is not limited to cells that proliferate indefinitely, but may also include cells with 
increased proliferative capacity compared to similar wild-type cells. Depending on the cell 
type, increased proliferative capacity may mean proliferation for at least about 100, about 
150, about 200, or about 400 or more generations, or for at least about 6, about 12, about 1 8, 
about 24 or about 36 or more months in in vitro culture. One use for immortalized cells is in 

1 0 production of natural proteins and recombinant proteins (e.g., therapeutic polypeptides such 
as erythropoietin, human growth hormone, insulin, and the like), or antibodies, for which a 
stable, genetically normal cell line is preferred. For production of some recombinant 
proteins, specialized cell types may also be preferred (e.g., pancreatic cells for the production 
of human insulin). Another use for immortalized cells or even mortal cells with increased 

1 5 proliferative capacity (relative to unmodified cells) is for introduction into a patient for gene 
therapy, or for replacement of diseased or damaged cells or tissue. For example, autologous 
immune cells containing or expressing a, e.g., recombinant hTRT gene or polypeptide of the 
invention can be used for cell replacement in a patient after aggressive cancer therapy, e.g., 
whole body irradiation. Another use for immortalized cells is for ex vivo production of 

20 "artificial" tissues or organs (e.g., skin) for therapeutic use. Another use for such cells is for 
screening or validation of drugs, such as telomerase-inhibiting drugs, or for use in production 
of vaccines or biological reagents. Additional uses of the cells of the invention will be 
apparent to those of skill. 

The immortalized cells and cell lines, as well as those of merely increased 

25 replicative capacity, of the invention are made by increasing telomerase activity in the cell. 
Any method disclosed herein for increasing telomerase activity can be used. Thus, in one 
embodiment, cells are immortalized by increasing the amount of an hTRT polypeptide in the 
cell. In one embodiment, hTRT levels are increased by introducing an hTRT expression 
vector into the cell (with stable transfection sometimes preferred). As discussed above, the 

30 hTRT coding sequence is usually operably linked to a promoter, which may be inducible or 
constitutively active in the cell. 
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In one embodiment, a polynucleotide comprising a sequence encoding a 
polypeptide of SEQUENCE ID NO: 2, which sequence is operably linked to a promoter (e.g., 
a constitutively expressed promoter, e.g., a sequence of SEQUENCE ID NO: 6), is introduced 
into the cell. In one embodiment the polynucleotide comprises a sequence of SEQUENCE 
5 ID NO: 1 . Preferably the polynucleotide includes polyadenylation and termination signals. 
In other embodiments, additional elements such as enhancers or others discussed supra are 
included. In an alternative embodiment, the polynucleotide does not include a promoter 
sequence, such sequence being provided by the target cell endogenous genome following 
integration (e.g., recombination, e.g., homologous recombination) of the introduced 

1 0 polynucleotide. The polynucleotide may be introduced into the target cell by any method, 
including any method disclosed herein, such as lipofection, electroporation, virosomes, 
liposomes, immunoliposomes, polycatiomnucleic acid conjugates, naked DNA). 

Using the methods of the invention, any vertebrate cell can be caused to have 
an increased proliferative capacity or even be immortalized and sustained indefinitely in 

1 5 culture. In one embodiment the cells are mammalian, with human cells preferred for many 
applications. Examples of human cells that can be immortalized include those listed in Table 
3. 

It will be recognized that the "diagnostic" assays of the invention described 
infra may be used to identify and characterize the immortalized cells of the invention. 

20 

TABLE 3 

HUMAN CELLS IN WHICH hTRT EXPRESSION MAY BE INCREASED 
Keratinizing Epithelial Cells 

keratinocyte of epidermis (differentiating epidermal cell) 
25 basal cell of epidermis (stem cell) 

keratinocyte of fingernails and toenails 
basal cell of nail bed (stem cell) 
hair shaft cells 

medullary, cortical, cuticular; hair-root sheath cells, 
30 cuticular, of Huxley r s layer, of Henle 1 s layer external; 

hair matrix cell (stem cell) 

Cells of Wet Stratified Barrier Epithelia 

surface epithelial cell of stratified squamous epithelium of 
35 tongue, oral cavity, esophagus, anal canal, distal urethra, 
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vagina 

basal cell of these epithelia (stem cell) 
cell of external corneal epithelium 

cell of urinary epithelium (lining bladder and urinary ducts) 

5 

Epithelial Cells Specialized for Exocrine Secretion 

cells of salivary gland 

mucous cell (secretion rich in polysaccharide) 
serous cell (secretion rich in glycoprotein enzymes) cell 
10 of von Ebner's gland in tongue (secretion to wash over 

taste buds) 

cell of mammary gland, secreting milk 

cell of lacrimal gland, secreting tears 

cell of ceruminous gland of ear, secreting wax 
15 cell of eccrine sweat gland, secreting glycoproteins (dark 

cell) 

cell of eccrine sweat gland, secreting small molecules (clear 
cell) 

cell of apocrine sweat gland (odoriferous secretion, 

20 sex- hormone sensitive) 

cell of gland of Moll in eyelid (specialized sweat gland) 
cell of sebaceous gland, secreting lipid-rich sebum 
cell of Bowman 1 s gland in nose (secretion to wash over 
olfactory epithelium) 

25 cell of Brunner's gland in duodenum, secreting alkaline 
solution of mucus and enzymes 

cell of seminal vesicle, secreting components of seminal 
fluid, including fructose (as fuel for swimming sperm) 
cell of prostate gland, secreting other components of seminal 
30 fluid 

cell of bulbourethral gland, secreting mucus 

cell of Bartholin's gland, secreting vaginal lubricant 

cell of gland of Littre, secreting mucus 

cell of endometrium of uterus, secreting mainly carbohydrates 
35 isolated goblet cell of respiratory and digestive tracts, 
secreting mucus 

mucous cell of lining of stomach 

zymogenic cell of gastric gland, secreting pepsinogen 
oxyntic cell of gastric gland, secreting HC1 
40 acinar cell of pancreas, secreting digestive enzymes and 
bicarbonate 

Paneth cell of small intestine, secreting lysozyme 
type II pneumocyte of lung, secreting surfactant 
Clara cell of lung 

45 

Cells specialized for Secretion of Hormones 
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cells of anterior pituitary, secreting 

growth hormone, follicle-stimulating hormone, 
luteinizing hormone, prolactin, adrenocorticotropic 
hormone, and thyroid-stimulating hormone, 
5 cell of intermediate pituitary, secreting 
melanocyte -stimulating hormone 
cells of posterior pituitary, secreting 

oxytocin, vasopressin 
cells of gut, secreting 
10 serotonin, endorphin, somatostatin, gastrin, 

secretin, cholecystokinin, insulin and glucagon 
cells of thyroid gland, secreting 

thyroid hormone, calcitonin 
cells of parathyroid gland, secreting 
15 parathyroid hormone, oxyphil cell 

cells of adrenal gland, secreting 

epinephrine, norepinephrine, and steroid hormones; 
mineralocorticoids 
glucocorticoids 
20 cells of gonads, secreting 

testosterone (Leydig cell of testis) 

estrogen (theca interna cell of ovarian follicle) 

progesterone (corpus luteum cell of ruptured ovarian 

follicle) 

25 cells of juxtaglomerular apparatus of kidney 
juxtaglomerular cell (secreting renin) 
macula densa cell 
peripolar cell 
mesangial cell 

30 

Epithelial Absorptive Cells in Gut, Exocrine Glands, and 
Urogenital Tract 

brush border cell of intestine (with microvilli) 

striated duct cell of exocrine glands 
35 gall bladder epithelial cell 

brush border cell of proximal tubule of kidney 

distal tubule cell of kidney 

nonciliated cell of ductulus efferens 

epididymal principal cell 
40 epididymal basal cell 

Cells Specialized for Metabolism and Storage 

hepatocyte (liver cell) 
fat cells 
45 white fat 

brown fat 

122 



lipocyte of liver 

Epithelial Cells Serving Primarily a Barrier Function, Lining 
the Lung, Gut, Exocrine Glands, and Urogenital Tract 

5 type I pneumocyte (lining air space of lung) 
pancreatic duct cell (centroacinar cell) 

nonstriated duct cell of sweat gland, salivary gland, mammary 
gland 

parietal cell of kidney glomerulus 
10 podocyte of kidney glomerulus 

cell of thin segment of loop of Henle (in kidney) 

collecting duct cell (in kidney) 

duct cell of seminal vesicle, prostate gland 

15 Epithelial Cells Lining Closed Internal Body Cavities 

vascular endothelial cells of blood vessels and lymphatics 
fenestrated 
continuous 
splenic 

20 synovial cell (lining joint cavities, secreting largely 
hyaluronic acid) 
serosal cell (lining peritoneal, pleural, and pericardial 
cavities) 

squamous cell lining perilymphatic space of ear 
25 cells lining endolymphatic space of ear 
squamous cell 

columnar cells of endolymphatic sac 
with microvilli 
without microvilli 
30 "dark" cell 

vestibular membrane cell (resembling choroid plexus cell) 
stria vascularis basal cell 
stria vascularis marginal cell 
cell of Claudius 
35 cell of Boettcher 

choroid plexus cell (secreting cerebrospinal fluid) 
squamous cell of pia-arachnoid 
cells of ciliary epithelium of eye 
pigmented 
40 nonpigmented 

corneal "endothelial" cell 

Ciliated Cells with Propulsive Function 

of respiratory tract 
45 of oviduct and of endometrium of uterus (in female) 
of rete testis and ductulus efferens (in male) 
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of central nervous system (ependymal cell lining brain 
cavities) 

Cells Specialized for Secretion of Extracellular Matrix 

5 epithelial : 

ameloblast (secreting enamel of tooth) 

planum semilunatum cell of vestibular apparatus of ear 

(secreting proteoglycan) 
interdental cell of organ of Corti (secreting tectorial 
10 "membrane" covering hair cells of organ of Corti) 

nonepithelial (connective tissue) 

fibroblasts (various -of loose connective tissue, of cornea, 
of tendon, of reticular tissue of bone marrow, etc.) 
pericyte of blood capillary 
15 nucleus pulposus cell of intervertebral disc 

cementoblast/cementocyte (secreting bonelike cementum of 

root of tooth) 

odontoblast /odontocyte (secreting dentin of tooth) 
chondrocytes 

20 of hyaline cartilage, of f ibrocartilage, of elastic 

cartilage 
osteoblast /osteocyte 

osteoprogenitor cell (stem cell of osteoblasts) 
hyalocyte of vitreous body of eye 
25 stellate cell of perilymphatic space of ear 

Contractile Cells 

skeletal muscle cells 
red (slow) 
30 white (fast) 

intermediate 

muscle spindle — nuclear bag 
muscle spindle — nuclear chain 
satellite cell (stem cell) 
35 heart muscle cells 
ordinary 
nodal 

Purkin j e f iber 
smooth muscle cells 
40 myoepithelial cells 
of iris 

of exocrine glands 

Cells of Blood and Immune System 

45 red blood cell 
megakaryocyte 



124 



macrophages 
monocyte 

connective tissue macrophage (various) 
Langerhans cell (in epidermis) 
5 osteoclast (in bone) 

dendritic cell (in lymphoid tissues) 
microglial cell (in central nervous system) 
neutrophil 
eosinophil 
10 basophil 
mast cell 
T lymphocyte 
helper T cell 
suppressor T cell 
15 killer T cell 

B lymphocyte 
IgM 
igG 
IgA 

20 IgE 

killer cell 

stem cells for the blood and immune system (various) 

Sensory Transducers 

25 photoreceptors 
rod 
cones 

blue sensitive 
green sensitive 
30 red sensitive 

hearing 

inner hair cell of organ of Corti 

outer hair cell of organ of Corti 
acceleration and gravity 
35 type I hair cell of vestibular apparatus of ear 

type II hair cell of vestibular apparatus of ear 
taste 

type 11 taste bud cell 
smell 

40 olfactory neuron 

basal cell of olfactory epithelium (stem cell for olfactory 

neurons ) 
blood Ph 

carotid body cell 
45 type I 

type II 
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touch 

Merkel cell of epidermis 

primary sensory neurons specialized for touch temperature 
primary sensory neurons specialized for temperature 
5 cold sensitive 

heat sensitive 

pain 

primary sensory neurons specialized for pain configurations 
and forces in musculoskeletal system 
10 proprioceptive primary sensory neurons 

Autonomic Neurons 

cholinergic 
adrenergic 
15 peptidergic 

Supporting Cells of Sense Organs and of Peripheral Neurons 

supporting cells of organ of Corti 
inner pillar cell 
20 outer pillar cell 

inner phalangeal cell 
outer phalangeal cell 
border cell 
Hensen cell 

25 supporting cell of vestibular apparatus 

supporting cell of taste bud (type I taste bud cell) 
supporting cell of olfactory epithelium 
Schwann cell 

satellite cell (encapsulating peripheral nerve cell bodies) 
30 enteric glial cell 

Neurons and Glial Cells of Central Nervous System 

neurons 
glial cells 
35 astrocyte 

o 1 i go dendro cy t e 

Lens Cells 

anterior lens epithelial cell 
40 lens fiber (crystallin-containing cell) 
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Pigment Cells 

melanocyte , retinal pigmented epithelial cell 

Germ Cells 

5 oogonium/oocyte 
spermatocyte 

spermatogonium (stem cell for spermatocyte) 

Nurse Cells 

10 ovarian follicle cell 

Sertoli cell (in testis) 
thymus epithelial cell 

Stem Cells 

15 embryonic stem cell 
embryonic germ cell 
adult stem cell 
fetal stem cell 

20 

IX. DIAGNOSTIC ASSAYS 
A) INTRODUCTION 
1)TRT ASSAYS 

The present invention provides a wide variety of assays for TRT, preferably 
25 hTRT, and telomerase. These assays provide, inter alia, the basis for sensitive, inexpensive, 
convenient, and widely applicable assays for diagnosis and prognosis of a number of human 
diseases, of which cancer is an illustrative example- As noted supra, hTRT gene products 
(protein and mRNA) are usually elevated in immortal human cells relative to most normal 
mortal cells (i.e., telomerase-negative cells and most telomerase-positive normal adult 
30 somatic cells). Thus, in one aspect, the invention provides assays useful for detecting or 

measuring the presence, absence, or quantity of an hTRT gene product in a sample from, or 
containing, human or other mammalian or eukayotic cells to characterize the cells as 
immortal (such as a malignant tumor cell) or mortal (such as most normal somatic cells in 
adults) or as telomerase positive or negative. 
35 Any condition characterized by the presence or absence of an hTRT gene 

product (i.e., protein or RNA) may be diagnosed using the methods and materials described 
herein. These include, as described more fully below, cancers, other diseases of accelerated 

127 



I! 



I'll 1 " I! 1 1 11 III 



cell proliferation, immunological disorders, fertility, infertility, and others. Moreover, 
because the degree to which telomerase activity is elevated in cancer cells is correlated with 
characteristics of the tumor, such as metastatic potential, monitoring hTRT, mRNA or protein 
levels can be used to estimate and predict the likely future progression of a tumor. 

In one aspect, the diagnostic and prognostic methods of the invention entail 
determining whether a human TRT gene product is present in a biological sample (e.g., from 
a patient). In a second aspect, the abundance of hTRT gene product in a biological sample 
(e.g., from a patient) is determined and compared to the abundance in a control sample (e.g., 
normal cells or tissues). In a third aspect, the cellular or intracellular localization of an hTRT 
gene product is determined in a cell or tissue sample. In a fourth aspect, host (e.g., patient) 
cells are assayed to identify nucleic acids with sequences characteristic of a heritable 
propensity for abnormal hTRT gene expression (abnormal quantity, regulation, or product), 
such as is useful in genetic screening or genetic counseling. In a fifth aspect, the assays of the 
invention are used detect the presence of anti-hTRT antibodies (e.g., in patient serunci). The 
methods described below in some detail are indicative of useful assays that can be carried out 
using the sequences and relationships disclosed herein. However, numerous variations or 
other applications of these assays will be apparent to those of ordinary skill in the art in view 
of this disclosure. 

It will be recognized that, although the assays below are presented in terms of 
diagnostic and prognostic methods, they may be used whenever an hTRT gene, gene product, 
or variant is to be detected, quantified, or characterized. Thus, for example, the "diagnostic" 
methods described infra are useful for assays of hTRT or telomerase during production and 
purification of hTRT or human telomerase, for characterization of cell lines derived from 
human cells (e.g., to identify immortal lines), for characterization of cells, non-human 
animals, plants, fungi, bacteria or other organisms that comprise a human TRT gene or gene 
product (or fragments thereof). 

As used herein, the term "diagnostic" has its usual meaning of identifying the 
presence or nature of a disease (e.g., cancer), condition (e.g., infertile, activated), or status 
(e.g., fertile), and the term "prognostic" has its usual meaning of predicting the probable 
development and/or outcome of a disease or condition. Although these two terms aire used in 
somewhat different ways in a clinical setting, it will be understood that any of the assays or 
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assay formats disclosed below in reference to "diagnosis" are equally suitable for 
determination of prognosis because it is well established that higher telomerase activity levels 
are associated with poorer prognoses for cancer patients, and because the present invention 
provides detection methods specific for hTRT, which is expressed at levels that closely 
5 correlate with telomerase activity in a cell . 

2) DIAGNOSIS AND PROGNOSIS OF CANCER 

The determination of an hTRT gene, mRNA or protein level above normal or 
standard range is indicative of the presence of telomerase-positive cells, or immortal, of 

10 which certain tumor cells are examples. Because certain embryonic and fetal cells, as well as 
certain adult stem cells, express telomerase, the present invention also provides methods for 
determining other conditions, such as pregnancy, by the detection or isolation of telomerase 
positive fetal cells from maternal blood. These values can be used to make, or aid in making, 
a diagnosis, even when the cells would not have been classified as cancerous or otherwise 

1 5 detected or classified using traditional methods. Thus, the methods of the present invention 
permit detection or verification of cancerous or other conditions associated with telomerase 
with increased confidence, and at least in some instances at an earlier stage. The assays of the 
invention allow discrimination between different classes and grades of human tumors or other 
cell-proliferative diseases by providing quantitative assays for the hTRT gene and gene 

20 products and thereby facilitate the selection of appropriate treatment regimens and accurate 
diagnoses. Moreover, because levels of telomerase activity can be used to distinguish 
between benign and malignant tumors (e.g., U.S. Patent No. 5,489,508; Hiyama et aL, 1997, 
Proc. Am Ass. Cancer Res. 38:637), to predict immanence of invasion (e.g., U.S. Patent 
No. 5,639,613; Yashima et al., 1997, Proc. Am Ass. Cancer Res. 38:326), and to correlate 

25 with metastatic potential (e.g., U.S. Patent No. 5,648,21 5; Pandita et al, 1 996, Proc. Am Ass. 
Cancer Res. 37:559), these assays will be useful for prophylaxis, detection, and treatment of a 
wide variety of human cancers. 

For prognosis of cancers (or other diseases or conditions characterized by 
elevated telomerase), a prognostic value of hTRT gene product (mRNA or protein) or activity 

30 for a particular tumor type, class or grade, is determined as described infra. hTRT protein or 
mRNA levels or telomerase activity in a patient can also be determined (e.g., using the assays 
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disclosed herein) and compared to the prognostic level 

Depending on the assay used, in some cases the abundance of an hTRT gene 
product in a sample will be considered elevated whenever it is detectable by the assay. Due 
to the low abundance of hTRT mRNA and protein even in telomerase-positive cells, and the 
rarity or non-existence of these gene products in normal or telomerase-negative cells, 
sensitive assays are required to detect the hTRT gene product if present at all in normal cells. 
If less sensitive assays are selected, hTRT gene products will be undetectable in healthy 
tissue but will be detectable in telomerase-positive cancer or other telomerase-positive cells. 
Typically, the amount of hTRT gene product in an elevated sample is at least about five, 
frequently at least about ten, more often at least about 50, and very often at least about 100 to 
1000 times higher than the levels in telomerase-negative control cells or cells from healthy 
tissues in an adult, where the percentage of telomerase-positive normal cells is very low. 

The diagnostic and prognostic methods of the present invention can be 
employed with any cell or tissue type of any origin and can be used to detect an immortal or 
neoplastic cell, or tumor tissue, or cancer, of any origin. Types of cancer that may be 
detected include, but are not limited to, all those listed supra in the discussion of therapeutic 
applications of hTRT. 

The assays of the invention are also useful for monitoring the efficacy of 
therapeutic intervention in patients being treated with anticancer regimens. Anticancer 
regimens that can be monitored include all presently approved treatments (including 
chemotherapy, radiation therapy, and surgery) and also includes treatments to be approved in 
the future, such as telomerase inhibition or activation therapies as described herein. (See, 
e.g., See PCT Publication Nos. 96/01835 and 96/40868 and U.S. Patent No. 5,583,016; all of 
which are incorporated by reference in their entirety). 

In another aspect, the assays described below are useful for detecting certain 
variations in hTRT gene sequence (mutations and heritable hTRT alleles) that are indicative 
of a predilection for cancers or other conditions associated with abnormal regulation of 
telomerase activity (infertility, premature aging). 

3) DIAGNOSIS OF CONDITIONS OTHER THAN CANCER 

In addition to diagnosis of cancers, the assays of the present invention have 
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numerous other applications. The present invention provides reagents and methods/diagnosis 
of conditions or diseases characterized by under- or over-expression of telomerase or hTRT 
gene products in cells. In adults, a low level of telomerase activity is normally found in a 
limited complement of normal human somatic cells, e.g., stem cells, activated lymphocytes 
and germ cells, and is absent from other somatic cells. Thus, the detection of hTRT or 
telomerase activity in cells in which it is normally absent or inactive, or detection at abnormal 
(i.e., higher or lower than normal) levels in cells in which hTRT is normally present at a low 
level (such as stem cells, activated lymphocytes and germ cells), can be diagnostic of a 
telomerase-related disease or condition or used to identify or isolate a specific cell type (i.e., 
to isolate stem cells). Examples of such diseases and conditions include: diseases of cell 
proliferation, immunological disorders, infertility, diseases of immune cell function, 
pregnancy, fetal abnormalities, premature aging, and others. Moreover, the assays of the 
invention are useful for monitoring the effectiveness of therapeutic intervention (including 
but not limited to drugs that modulate telomerase activity) in a patient or in a cell- or animal- 
based assay. 

In one aspect, the invention provides assays useful for diagnosing infertility. 
Human germ cells (e.g., spermatogonia cells, their progenitors or descendants) are capable of 
indefinite proliferation and characterized by high telomerase activity. Abnormal levels or 
products or diminished levels of hTRT gene products can result in inadequate or abnormal 
production of spermatozoa, leading to infertility or disorders of reproduction. Accordingly, 
the invention provides assays (methods and reagents) for diagnosis and treatment of 
"telomerase-based" reproductive disorders. Similarly, the assays can be used to monitor the 
efficacy of contraceptives (e.g., male contraceptives) that target or indirectly affect sperm 
production (and which would reduce hTRT levels or telomerase activity). 

In another aspect, the invention provides assays for analysis of telomerase and 
hTRT levels and function in stem cells, fetal cells, embryonic cells, activated lymphocytes 
and hematopoietic stem cells. For example, assays for hTRT gene product detection can be 
used to monitor immune function generally (e.g., by monitoring the prevalence of activated 
lymphocytes or abundance of progenitor stem cells), to identify or select or isolate activated 
lymphocytes or stem cells (based on elevated hTRT levels), and to monitor the efficacy of 
therapeutic interventions targeting these tissues (e.g., immunosuppressive agents or 
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therapeutic attempt to expand a stem cell population). 

The invention also provides assays useful for identification of anti-telomerase 
and anti-TRT immunoglobulins (found in serum from a patient). The materials and assays 
described herein can be used to identify patients in which such autoimmune antibodies are 
found, permitting diagnosis and treatment of the condition associated with the 
immunoglobulins. 

4) MONITORING CELLS IN CULTURE 

The assays described herein are also useful for monitoring the expression of 
hTRT gene products and characterization of hTRT genes in cells ex vivo or in vitro. Because 
elevated hTRT levels are characteristic of immortalized cells, the assays of the invention can 
be used, for example, to screen for, or identify, immortalized cells or to identify an agent 
capable of mortalizing immortalized cells by inhibiting hTRT expression or function. For 
example, the assay will be useful for identifying cells immortalized by increased expression 
of hTRT in the cell, e.g., by the expression of a recombinant hTRT or by increased expression 
of an endogenously coded hTRT (e.g., by promoter activation). 

Similarly, these assays may be used to monitor hTRT expression in transgenic 
animals or cells (e.g., yeast or human cells containing an hTRT gene). In particular, the 
effects of certain treatments (e.g., application of known or putative telomerase antagonists) on 
the hTRT levels in human and nonhuman cells expressing the hTRT of the invention can be 
used for identifying useful drugs and drug candidates (e.g., telomerase activity-modulating 
drugs). 

B) NORMAL, DIAGNOSTIC, AND PROGNOSTIC VALUES 

Assays for the presence or quantity of hTRT gene products may be carried out 
and the results interpreted in a variety of ways, depending on the assay format, the nature of 
the sample being assayed, and the information sought. For example, the steady state 
abundance of hTRT gene products is so low in most human somatic tissues that they are 
undetectable by certain assays. Moreover, there is generally no telomerase activity in the 
cells of these tissues, making verification of activity quite easy. Conversely, hTRT protein 
and/or hTRT mRNA or telomerase is sufficiently abundant in other telomerase-positive 
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tissues, e.g., malignant tumors, so that the same can be detected using the same assays. Even 
in those somatic cell types in which low levels of telomerase activity can normally be 
detected (e.g., stem cells, and certain activated hematopoietic system cells), the levels of 
hTRT mRNA and telomerase activity are a small fraction (e.g., estimated at about 1% or less) 
5 of the levels in immortal cells; thus, immortal and mortal cells may be easily distinguished by 
the methods of the present invention. It will be appreciated that, when a "less sensitive" 
assay is used, the mere detection of the hTRT gene product in a biological sample can itself 
be diagnostic, without the requirement for additional analysis. Moreover, although the assays 
described below can be made exquisitely sensitive, they may also, if desired, be made less 
10 sensitive (e.g., through judicious choice of buffers, wash conditions, numbers of rounds of 
amplification, reagents, and/or choice of signal amplifiers). Thus, virtually any assay can be 
designed so that it detects hTRT gene products only in biological samples in which they are 
present at a particular concentration, e.g. a higher concentration than in healthy or other 
control tissue. In this case, any detectable level of hTRT mRNA or protein will be considered 
1 5 elevated in cells from post-natal human somatic tissue (other than hematopoietic cells and 
other stem cells). 

In some cases, however, it will be desirable to establish normal or baseline 
values (or ranges) for hTRT gene product expression levels, particularly when very sensitive 
assays capable of detecting very low levels of hTRT gene products that may be present in 

20 normal somatic cells are used. Normal levels of expression or normal expression products 
can be determined for any particular population, subpopulation, or group of organisms 
according to standard methods well known to those of skill in the art and employing the 
methods and reagents of the invention. Generally, baseline (normal) levels of hTRT protein 
or hTRT mRNA are determined by quantitating the amount of hTRT protein and/or mRNA in 

25 biological samples (e.g., fluids, cells or tissues) obtained from normal (healthy) subjects, e.g., 
a human subject. For certain samples and purposes, one may desire to quantitate the amount 
of hTRT gene product on a per cell, or per tumor cell, basis. To determine the cellularity of a 
sample, one may measure the level of a constitutively expressed gene product or other gene 
product expressed at known levels in cells of the type from which the sample was taken. 

30 Alternatively, normal values of hTRT protein or hTRT mRNA can be determined by 

quantitating the amount of hTRT protein/RNA in cells or tissues known to be healthy, which 
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are obtained from the same patient from whom diseased (or possibly diseased) cells aire 
collected or from a healthy individual. Alternatively, baseline levels can be defined in some 
cases as the level present in non-immortal human somatic cells in culture. It is possible that 
normal (baseline) values may differ somewhat between different cell types (for example, 
5 hTRT mRNA levels will be higher in testis than kidney), or according to the age, sex, or 
physical condition of a patient. Thus, for example, when an assay is used to determine 
changes in hTRT levels associated with cancer, the cells used to determine the normal range 
of hTRT gene product expression can be cells from persons of the same or a different age, 
depending on the nature of the inquiry. Application of standard statistical methods used in 

10 molecular genetics permits determination of baseline levels of expression, as well as permits 
identification of significant deviations from such baseline levels. 

In carrying out the diagnostic and prognostic methods of the invention, as 
described above, it will sometimes be useful to refer to "diagnostic" and "prognostic values." 
As used herein, "diagnostic value" refers to a value that is determined for the hTRT gene 

1 5 product detected in a sample which, when compared to a normal (or "baseline") range of the 
hTRT gene product is indicative of the presence of a disease. The disease may be 
characterized by high telomerase activity (e.g., cancer), the absence of telomerase activity 
(e.g., infertility), or some intermediate value. "Prognostic value" refers to an amount of the 
hTRT gene product detected in a given cell type (e.g., malignant tumor cell) that is consistent 

20 with a particular diagnosis and prognosis for the disease (e.g., cancer). The amount 

(including a zero amount) of the hTRT gene product detected in a sample is compared to the 
prognostic value for the cell such that the relative comparison of the values indicates the 
presence of disease or the likely outcome of the disease (e.g., cancer) progression. In one 
embodiment, for example, to assess tumor prognosis, data are collected to obtain a 

25 statistically significant correlation of hTRT levels with different tumor classes or grades. A 
predetermined range of hTRT levels is established for the same cell or tissue sample obtained 
from subjects having known clinical outcomes. A sufficient number of measurements is 
made to produce a statistically significant value (or range of values) to which a comparison 
will be made. The predetermined range of hTRT levels or activity for a given cell or tissue 

30 sample can then be used to determine a value or range for the level of hTRT gene product that 
would correlate to favorable (or less unfavorable) prognosis (e.g., a "low level" in the case of 
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cancer). A range corresponding to a "high level" correlated to an (or a more) unfavorable 
prognosis in the case of cancer can similarly be determined. The level of hTRT gene product 
from a biological sample (e.g., a patient sample) can then be determined and compared to the 
low and high ranges and used to predict a clinical outcome. 
5 Although the discussion above refers to cancer for illustration, it will be 

understood that diagnostic and prognostic values can also be determined for other diseases 
(e.g., diseases of cell proliferation) and conditions and that, for diseases or conditions other 
than cancer, a "high" level may be correlated with the desired outcome and a "low" level 
correlated with an unfavorable outcome. For example, some diseases may be characterized 

10 by a deficiency (e.g., low level) of telomerase activity in stem cells, activated lymphocytes, or 
germline cells. In such cases, "high" levels of hTRT gene products relative to cells of similar 
age and/or type (e.g., from other patients or other tissues in a particular patient) may be 
correlated with a favorable outcome. 

It will be appreciated that the assay methods do not necessarily require 

15 measurement of absolute values of hTRT, unless it is so desired, because relative values are 
sufficient for many applications of the methods of the present invention. Where quantitation 
is desirable, the present invention provides reagents such that virtually any known method for 
quantitating gene products can be used. 

The assays of the invention may also be used to evaluate the efficacy of a 

20 particular therapeutic treatment regime in animal studies, in clinical trials, or in monitoring 
the treatment of an individual patient. In these cases, it may be desirable to establish the 
baseline for the patient prior to commencing therapy and to repeat the assays one or more 
times through the course of treatment, usually on a regular basis, to evaluate whether hTRT 
levels are moving toward the desired endpoint (e.g., reduced expression of hTRT when the 

25 assay is for cancer) as a result of the treatment. 

One of skill will appreciate that, in addition to the quantity or abundance of 
hTRT gene products, variant or abnormal expression patterns (e.g., abnormal amounts of 
RNA splicing variants) or variant or abnormal expression products (e.g., mutated transcripts, 
truncated or non-sense polypeptides) may also be identified by comparison to normal 

30 expression levels and normal expression products. In these cases determination of "normal" 
or "baseline" involves identifying healthy organisms and/or tissues (i.e. organisms and/or 
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tissues without hTRT expression disregulation or neoplastic growth) and measuring 
expression levels of the variant hTRT gene products (e.g., splicing variants), or sequencing or 
detecting the hTRT gene, mRNA, or reverse transcribed cDNA to obtain or detect typical 
(normal) sequence variations. Application of standard statistical methods used in molecular 
5 genetics permits determination of significant deviations from such baseline levels. 

C) DETECTION AND QUANTITATION OF TRT GENE PRODUCTS 

As has been emphasized herein, hTRT gene products are usually found in 
most normal somatic cells at extremely low levels. For example, the mRNA encoding hTRT 

10 protein is extremely rare or absent in all telomerase-negative cell types studied thus far. In 
immortal cells, such as 293 cells, hTRT mRNA may be present at only about 1 00 copies per 
cell, while normal somatic cells may have as few as one or zero copies per cell. It will thus 
be apparent that, when highly sensitive assays for hTRT gene products are desired, it will 
sometimes be advantageous to incorporate signal or target amplification technologies into the 

15 assay format. See, for example, Plenat et al., 1997, Ann. Pathol 17:17 

(fluoresceinyl-tyramide signal amplification); Zehbeetal., 1997, J. Pathol. 150:1553 
(catalyzed reporter deposition); other references listed herein (e.g., for bDNA signal 
amplification, for PCR and other target amplification formats); and other techniques known 
in the art. 

20 As noted above, it is often unnecessary to quantitate the hTRT mRNA or 

protein in the assays disclosed herein, because the detection of an hTRT gene product (under 
assay conditions in which the product is not detectable in control, e.g., telomerase-negative 
cells) is in itself sufficient for a diagnosis. As another example, when the levels of product 
found in a test (e.g., tumor) and control (e.g., healthy cell) samples are directly compared, 

25 quantitation may be superfluous. 

When desired, however, quantities of hTRT gene product measured in the 
assays described herein may be described in a variety of ways, depending on the method of 
measurement and convenience. Thus, normal, diagnostic, prognostic, high or low q[uantities 
of hTRT protein/mRNA may be expressed as standard units of weight per quantity of 

30 biological sample (e.g., picograms per gram tissue, picograms per 10 12 cells), as a number of 
molecules per quantity of biological sample (e.g., transcripts/cell, moles/cell), as units of 
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activity per cell or per other unit quantity, or by similar methods. The quantity of hTRT gene 
product can also be expressed in relation to the quantity of another molecule; examples 
include: number of hTRT transcripts in sample/number of 28S rRNA transcripts in sample; 
nanograms of hTRT protein/ nanograms of total protein; and the like. 
5 When measuring hTRT gene products in two (or more) different samples, it 

will sometimes be usefixl to have a common basis of comparison for the two samples. For 
example, when comparing a sample of normal tissue and a sample of cancerous tissue, equal 
amounts of tissue (by weight, volume, number of cells, etc.) can be compared. Alternatively, 
equivalents of a marker molecule (e.g., 28S rRNA, hTR, telomerase activity, telomere length, 
1 0 actin) may be used. For example, the amount of hTRT protein in a healthy tissue sample 
containing 10 picograms of 28S rRNA can be compared to a sample of diseased tissue 
containing the same amount of 28S rRNA. 

It will also be recognized by those of skill that virtually any of the assays 
described herein can be designed to be quantitative. Typically, a known quantity or source of 
1 5 an hTRT gene product (e.g., produced using the methods and compositions of the invention) 
is used to calibrate the assay. 

In certain embodiments, assay formats are chosen that detect the presence, 
absence, or abundance of an hTRT allele or gene product in each cell in a sample (or in a 
representative sampling). Examples of such formats include those that detect a signal by 
20 histology (e.g., immunohistochemistry with signal-enhancing or target-enhancing 

amplification steps) or fluorescence-activated cell analysis or cell sorting (FACS). These 
formats are particularly advantageous when dealing with a highly heterogeneous cell 
population (e.g., containing multiple cells types in which only one or a few types have 
elevated hTRT levels, or a population of similar cells expressing telomerase at different 
25 levels). 



D) SAMPLE COLLECTION 

The hTRT gene or gene product (i.e., mRNA or polypeptide) is preferably 
detected and/or quantified in a biological sample. Such samples include, but are not limited 
30 to, cells (including whole cells, cell fractions, cell extracts, and cultured cells or cell lines), 
tissues (including blood, blood cells (e.g., white cells), and tissue samples such as fine needle 
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biopsy samples (e.g., from prostate, breast, thyroid, etc.)), body fluids (e.g., urine, sputum, 
amniotic fluid, blood, peritoneal fluid, pleural fluid, semen) or cells collected therefrom (e.g., 
bladder cells from urine, lymphocytes from blood), media (from cultured cells or cell lines), 
and washes (e.g., of bladder and lung). Biological samples may also include sections of 
5 tissues such as frozen sections taken for histological purposes. For cancer diagnosis and 
prognosis, a sample will be obtained from a cancerous or precancerous or suspected 
cancerous tissue or tumor. It will sometimes be desirable to freeze a biological sample for 
later analysis (e.g., when monitoring efficacy of drug treatments). 

In some cases, the cells or tissues may be fractionated before analysis. For 
1 0 example, in a tissue biopsy from a patient, a cell sorter (e.g., a fluorescence-activated cell 
sorter) may be used to sort cells according to characteristics such as expression of a surface 
antigen (e.g., a tumor specific antigen) according to well known methods. 

Although the sample is typically taken from a human patient or cell line, the 
assays can be used to detect hTRT homolog genes or gene products in samples from other 
1 5 animals. Alternatively, hTRT genes and gene products can be assayed in transgenic animals 
or organisms expressing a human TRT protein or nucleic acid sequence. 

The sample may be pretreated as necessary by dilution in an appropriate 
buffer solution or concentrated, if desired. Any of a number of standard aqueous buffer 
solutions, employing one of a variety of buffers, such as phosphate, Tris-buffer, or the like, at 
20 physiological pH can be used. 

A "biological sample" obtained from a patient can be referred to either as a 
"biological sample" or a "patient sample." It will be appreciated that analysis of a "patient 
sample" need not necessarily require removal of cells or tissue from the patient. For 
example, appropriately labeled hTRT-binding agents (e.g., antibodies or nucleic acids) can be 
25 injected into a patient and visualized (when bound to the target) using standard imaging 
technology (e.g., CAT, NMR, and the like.) 

E) NUCLEIC ACID ASSAYS 

In one embodiment, this invention provides for methods of detecting and/or 
30 quantifying expression of hTRT mRNAs (including splicing or sequence variants and 
alternative alleles). In an alternative embodiment, the invention provides methods for 
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detecting and analyzing normal or abnormal hTRT genes (or fragments thereof). The form of 
such qualitative or quantitative assays may include, but is not limited to, amplification-based 
assays with or without signal amplification, hybridization based assays, and combination 
amplification-hybridization assays. It will be appreciated by those of skill that the distinction 
5 between hybridization and amplification is for convenience only: as illustrated in the 
examples below, many assay formats involve elements of both hybridization and 
amplification, so that the categorization is somewhat arbitrary in some cases. 

1) PREPARATION OF NUCLEIC ACIDS 

10 In some embodiments, nucleic acid assays are performed with a sample of 

nucleic acid isolated from the cell, tissue, organism, or cell line to be tested. The nucleic acid 
(e.g., genomic DNA, RNA or cDNA) may be "isolated" from the sample according to any of 
a number of methods well known to those of skill in the art. In this context, "isolated" refers 
to any separation of the species or target to be detected from any other substance in the 

1 5 mixture, but does not necessarily indicate a significant degree of purification of the target. 
One of skill will appreciate that, where alterations in the copy number of the hTRT gene are 
to be detected, genomic DNA is the target to be detected. Conversely, where expression 
levels of a gene or genes are to be detected, RNA is the target to be detected in a nucleic 
acid-based assay. In one preferred embodiment, the nucleic acid sample is the total mRNA 

20 (i.e., poly(A) + RNA) in a biological sample. Methods for isolating nucleic acids are well 
known to those of skill in the art and are described, for example, Tijssen, P. ed. of 
Laboratory Techniques in Biochemistry and Molecular Biology: Hybridization 
With Nucleic Acid Probes, Part I. Theory and Nucleic Acid Preparation, Elsevier, 
N.Y. (1993) Chapt. 3, which is incorporated herein by reference. In one embodiment, the 

25 total nucleic acid is isolated from a given sample using an acid guanidinium-phenol- 
chloroform extraction method and poly(A)+ mRNA is isolated by oligo-dT column 
chromatography or by using (dT)n magnetic beads (see, e.g., Sambrook et al., and Ausubel et 
al., supra). 

In alternative embodiments, it is not necessary to isolate nucleic acids (e.g., 
30 total or polyA* RNA) from the biological sample prior to carrying out amplification, 

hybridization or other assays. These embodiments have certain advantages when hTRT RNA 

139 



ii 



'i| II 1 IN || 




is to be measured, because they reduce the possibility of loss of hTRT mRNA during 
isolation and handling. For example, many amplification techniques such as PCR and RT- 
PCR defined above can be carried out using permeabilized ceils (histological specimens and 
FACS analyses), whole lysed cells, or crude cell fractions such as certain cell extracts. 
5 Preferably, steps are taken to preserve the integrity of the target nucleic acid (e.g., mRNA) if 
necessary (e.g., addition of RNAase inhibitors). Amplification and hybridization assays can 
also be carried out in situ, for example, in thin tissue sections from a biopsy sample or from a 
cell monolayer (e.g., blood cells or disagregated tissue culture cells). Amplification can also 
be carried out in an intact whole cell or fixed cells. For example, PCR, RT-PCR, or LCR 

10 amplification methods may be carrier out, as is well known in the art, in situ, e.g., using a 
polymerase or ligase, a primer or primer(s), and (deoxy)ribonucleoside triphosphates (if a 
polymerase is employed), and reverse transcriptase and primer (if RNA is to be transcribed 
and the cDNA is to be detected) on fixed, permeabilized, or microinjected cells to amplify 
target hTRT RNA or DNA. Cells containing hTRT RNA (e.g., telomerase positive cells) or 

15 an hTRT DNA sequence of interest can then be detected. This method is often useful when 
fluorescently-labeled dNTPs, primers, or other components are used in conjunction with 
microscopy, FACS analysis or the equivalent 



amplification-based assays for detection of an hTRT gene or gene product. In an 
amplification based assay, all or part of an hTRT gene or transcript (e.g., mRNA or cDNA; 
hereinafter also referred to as "target") is amplified, and the amplification product is then 
detected directly or indirectly. When there is no underlying gene or gene product to act as a 

25 template, no amplification product is produced (e.g., of the expected size), or amplification is 
non-specific and typically there is no single amplification product In contrast, when the 
underlying gene or gene product is present, the target sequence is amplified, providing an 
indication of the presence and/or quantity of the underlying gene or mRNA. Target 
amplification-based assays are well known to those of skill in the art. 

30 The present invention provides a wide variety of primers and probes for 

detecting hTRT genes and gene products. Such primers and probes are sufficiently 
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2) AMPLIFICATION BASED ASSAYS 

In one embodiment, the assays of the present invention are 
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complementary to the hTRT gene or gene product to hybridize to the target nucleic acid. 
Primers are typically at least 6 bases in length, usually between about 10 and about 100 bases, 
typically between about 12 and about 50 bases, and often between about 14 and about 25 
bases in length. One of skill, having reviewed the present disclosure, will be able, using 
5 routine methods, to select primers to amplify all, or any portion, of the hTRT gene or gene 
product, or to distinguish between variant gene products, hTRT alleles, and the like. Table 2 
lists illustrative primers useful for PCR amplification of the hTRT, or specific hTRT gene 
products or regions. As is known in the art, single oligomers (e.g., U.S. Pat. No. 5,545,522), 
nested sets of oligomers, or even a degenerate pool of oligomers may be employed for 
10 amplification, e.g., as illustrated by the amplification of the Tetrahymena TRT cDNA as 
described infra. 

The invention provides a variety of methods for amplifying and detecting an 
hTRT gene or gene product, including the polymerase chain reaction (including all variants, 
e.g., reverse-transcriptase-PCR; the Sunrise Amplification System (Oncor, Inc, Gaitbersburg 

15 MD); and numerous others known in the art). In one illustrative embodiment, PCR 

amplification is carried out in a 50 pi solution containing the nucleic acid sample (e.g., cDNA 
obtained through reverse transcription of hTRT RNA), 100 |iM in each dNTP (dATP, dCTP, 
dGTP and dTTP; Pharmacia LKB Biotechnology, NJ), the hTRT-specific PCR primer(s), 1 
unit/ Taq polymerase (Perkin Elmer, Norwalk CT), lx PCR buffer (50 mM KC1, 10 mM Tris, 

20 pH 8.3 at room temperature, 1 .5 mM MgCl 2 , 0.01% gelatin) with the amplification run for 
about 30 cycles at 94° for 45 sec, 55° for 45 sec and 72° for 90 sec. However, as will be 
appreciated, numerous variations may be made to optimize the PCR amplification for any 
particular reaction. 

Other suitable target amplification methods include the ligase chain reaction 
25 (LCR; e.g., Wu and Wallace, 1989, Genomics 4:560; Landegren et al, 1988, Science, 241 : 
1077, Barany, 1991, Proc. Natl Acad Set USA 88:189 and Barringer et al, 1990, Gene, 89: 
1 17); strand displacement amplification (SDA; e.g., Walker et al., 1992, Proc. Natl Acad. 
Set U.S.A. 89:392-396); transcription amplification (e.g., Kwoh et al, 1989, Proc. Natl 
Acad. Set USA, 86: 1173); self-sustained sequence replication (3SR; e.g., Fahy et al., 1992, 
30 PCR Methods Appl 1:25, Guatelli et al., 1990, Proc. Nat Acad. Sci. USA, 87: 1874); the 
nucleic acid sequence based amplification (NASBA, Cangene, Mississauga, Ontario; e.g., 
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Compton, 1991, Nature 350:91); the transcription-based amplification system (TAS); and the 
self-sustained sequence replication system (SSR). Each of the aforementioned publications is 
incorporated herein by reference. One useful variant of PCR is PCR ELISA (e.g., Boehringer 
Mannheim Cat. No. 1 636 1 1 1) in which digoxigenin-dUTP is incorporated into the PCR 
product. The PCR reaction mixture is denatured and hybridized with a biotin-labeled 
oligonucleotide designed to anneal to an internal sequence of the PCR product. The 
hybridization products are immobilized on streptavidin coated plates and detected using anti- 
digoxigenin antibodies. Examples of techniques sufficient to direct persons of skill through 
in vitro amplification methods are found in PCR TECHNOLOGY: PRINCIPLES AND 
Applications for DNA Amplification, H. Erlich, Ed. Freeman Press, New York, NY 
(1 992); PCR Protocols: A Guide to Methods and Applications, eds. Innis, Gelfiand, 
Snisky, and White, Academic Press, San Diego, CA (1990); Mattila et al., 1991, Nucleic 
Acids Res. 19: 4967; Eckert and Kunkel, (1991) PCR Methods and Applications 1:17; 
PCR, eds. McPherson, Quirkes, and Taylor, IRL Press, Oxford; U.S. Patent Nos. 4,683,195, 
4,683,202, and 4,965,188; Barringer et al., 1990, Gene, 89:1 17; Lomell et al., 1989, J. Clin. 
Chem., 35:1826, each of which is incorporated herein for all purposes. 

Amplified products may be directly analyzed, e.g., by size as determined by 
gel electrophoresis; by hybridization to a target nucleic acid immobilized on a solid support 
such as a bead, membrane, slide, or chip; by sequencing; immunologically, e.g., by PCR- 
ELISA, by detection of a fluorescent, phosphorescent, or radioactive signal; or by any of a 
variety of other well-known means. For example, an illustrative example of a detection 
method uses PCR primers augmented with hairpin loops linked to fluorescein and a benzoic 
acid derivative that serves as a quencher, such that fluorescence is emitted only when the 
primers unfold to bind their targets and replication occurs. 

Because hTRT mRNA is typically expressed as an extremely rare transcript, 
present at very low levels even in telomerase positive cells, it is often desirable to optimize or 
increase the signal resulting from the amplification step. One way to do this is to increase the 
number of cycles of amplification. For example, although 20-25 cycles are adequate for 
amplification of most mRNAs using the polymerase chain reaction under standard reaction 
conditions, detection of hTRT mRNA in many samples can require as many as 30 to 35 
cycles of amplification, depending on detection format and efficiency of amplification. It 
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will be recognized that judicious choice of the amplification conditions including the number 
of amplification cycles can be used to design an assay that results in an amplification product 
only when there is a threshold amount of target in the test sample (i.e., so that only samples 
with a high level of hTRT mRNA give a "positive" result). In addition, methods are known 
to increase signal produced by amplification of the target sequence. Methods for augmenting 
the ability to detect the amplified target include signal amplification system such as: branched 
DNA signal amplification (e.g., U.S. Pat No. 5,124,246; Urdea, 1994, Bio/Tech 12:926); 
tyramide signal amplification (TSA) system (Du Pont); catalytic signal amplification (CSA; 
Dako); Q Beta Replicase systems (Tyagi et al., 1996, Proc. Nat. Acad Set USA, 93: 5395 ); 
or the like. 

One of skill in the art will appreciate that whatever amplification method is 
used, a variety of quantitative methods known in the art can be used if quantitation is; desired. 
For example, when desired, two or more polynucleotides can be co-amplified in a single 
sample. This method can be used as a convenient method of quantitating the amount of 
hTRT mRNA in a sample, because the reverse transcription and amplification reactions are 
carried out in the same reaction for a target and control polynucleotide. The co-amplification 
of the control polynucleotide (usually present at a known concentration or copy number) can 
be used for normalization to the cell number in the sample as compared to the amount of 
hTRT in the sample. Suitable control polynucleotides for co-amplification reactions include 
DNA, RNA expressed from housekeeping genes, constitutively expressed genes, and in vitro 
synthesized RNAs or DNAs added to the reaction mixture. Endogenous control 
polynucleotides are those that are already present in the sample, while exogenous control 
polynucleotides are added to a sample, creating a "spiked" reaction. Illustrative control 
RNAs include P-actin RNA, GAPDH RNA, snRNAs, hTR, and endogenously expressed 28S 
rRNA (see Khan et al., 1992, Neurosci. Lett. 147:1 14). Exogenous control polynucleotides 
include a synthetic AW106 cRNA, which may be synthesized as a sense strand from 
pAW106 by T7 polymerase. It will be appreciated that for the co-amplification method to be 
useful for quantitation, the control and target polynucleotides must typically both be 
amplified in a linear range. Detailed protocols for quantitative PCR may be found in PCR 
Protocols, A Guide to Methods and Applications, Innis et al., Academic Press, Inc. 
N.Y., (1990) and Ausubel et al., supra (Unit 15) and Diaco, R. (1995) Practical 
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Considerations for the Design of Quantitative PCR Assays, in PCR Strategies, pg. 84-108, 
Innis et al. eds, Academic Press, New York. 

Depending on the sequence of the endogenous or exogenous standard., 
different primer sets may be used for the co-amplification reaction. In one method, called 
competitive amplification, quantitative PCR involves simultaneously co-amplifying a known 
quantity of a control sequence using the same primers used for amplification of the target 
nucleic acid (one pair of 2 primers). In an alternative embodiment, known as non- 
competitive competition, the control sequence and the target sequence (e.g., hTRT cDNA) are 
amplified using different primers (i.e., 2 pairs of 2 primers). In another alternative 
embodiment, called semi-competitive amplification, three primers are used, one of which is 
hTRT-specific, one of which is control specific, and one of which is capable of annealing to 
both the target and control sequences. Semi-competitive amplification is described in U.S. 
Patent No. 5,629,154, which is incorporated herein by reference. 

3) HYBRIDIZATION-BASED ASSAYS 
a) GENERALLY 

A variety of methods for specific DNA and RNA measurement using nucleic 
acid hybridization techniques are known to those of skill in the art (see Sambrook et al., 
supra). Hybridization based assays refer to assays in which a probe nucleic acid is hybridized 
to a target nucleic acid. Usually the nucleic acid hybridization probes of the invention are 
entirely or substantially identical to a contiguous sequence of the hTRT gene or RNA 
sequence. Preferably, nucleic acid probes are at least about 10 bases, often at least aibout 20 
bases, and sometimes at least about 200 bases or more in length. Methods of selecting 
nucleic acid probe sequences for use in nucleic acid hybridization are discussed in Sambrook 
et aL, supra. In some formats, at least one of the target and probe is immobilized. The 
immobilized nucleic acid may be DNA, RNA, or another oligo- or poly-nucleotide, and may 
comprise natural or non-naturally occurring nucleotides, nucleotide analogs, or backbones. 
Such assays may be in any of several formats including: Southern, Northern, dot and slot 
blots, high-density polynucleotide or oligonucleotide arrays (e.g., GeneChips™ Affymetrix), 
dip sticks, pins, chips, or beads. All of these techniques are well known in the art and are the 
basis of many commercially available diagnostic kits. Hybridization techniques are generally 
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described in Haines et al, ed., Nucleic Acid Hybridization, A Practical Approach IRL 
Press, (1985); Gall and Pardue Proc. Natl. Acad. ScL, U.S.A., 63: 378-383 (1969); and John 
et al., Nature, 223: 582-587 (1969). 

A variety of nucleic acid hybridization formats are known to those skilled in 
5 the art. For example, one common format is direct hybridization, in which a target nucleic 
acid is hybridized to a labeled, complementary probe. Typically, labeled nucleic acids are 
used for hybridization, with the label providing the detectable signal. One method for 
evaluating the presence, absence, or quantity of hTRT mRNA is carrying out a Northern 
transfer of RNA from a sample and hybridization of a labeled hTRT specific nucleic acid 

1 0 probe, as illustrated in Example 2. As was noted supra, hTRT mRNA, when present at all, is 
present in very low quantities in most cells. Therefore, when Northern hybridization is used, 
it will often be desirable to use an amplification step (or, alternatively, large amounts of 
starting RNA). A useful method for evaluating the presence, absence, or quantity of DNA 
encoding hTRT proteins in a sample involves a Southern transfer of DNA from a sample and 

1 5 hybridization of a labeled hTRT specific nucleic acid probe. 

Other common hybridization formats include sandwich assays and 
competition or displacement assays. Sandwich assays are commercially useful hybridization 
assays for detecting or isolating nucleic acid sequences. Such assays utilize a "capture" 
nucleic acid covalently immobilized to a solid support and a labeled "signal" nucleic acid in 

20 solution. The biological or clinical sample will provide the target nucleic acid. The "capture" 
nucleic acid and "signal" nucleic acid probe hybridize with the target nucleic acid to form a 
"sandwich" hybridization complex. To be effective, the signal nucleic acid cannot hybridize 
with the capture nucleic acid. 

25 b) CHIP-BASED AND SLIDE-BASED ASSAYS 

The present invention also provides probe-based hybridization assays for 
hTRT gene products employing arrays of immobilized oligonucleotide or polynucleotides to 
which an hTRT nucleic acid can hybridize (i.e., to some, but usually not all or even most, of 
the immobilized oligo- or poly-nucleotides). High density oligonucleotide arrays or 
30 polynucleotide arrays provide a means for efficiently detecting the presence and 

characteristics (e.g., sequence) of a target nucleic acid (e.g., hTRT gene, mRNA, or cDNA). 
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Techniques are known for producing arrays containing thousands of oligonucleotides 
complementary to defined sequences, at defined locations on a surface using 
photolithographic techniques for synthesis in situ (see, e.g., U.S. Patent Nos. 5,578,832; 
5,556,752; and 5,510,270; Fodor et al., 1991, Science 251:767; Pease et al., 1994, Proc. Natl. 
5 Acad. Sci. USA 91:5022; and Lockhart et al., 1996, Nature Biotech 14:1675) or other methods 
for rapid synthesis and deposition of defined oligonucleotides (Blanchard et al., 1996, 
Biosensors & Bioelectronics 1 1:687). When these methods are used, oligonucleotides (e.g., 
20-mers) of known sequence are synthesized directly on a surface such as a derivatized glass 
slide. Usually, the array produced is redundant, having several oligonucleotide probes on the 
1 0 chip specific for the hTRT polynucleotide to be detected. 

Combinations of oligonucleotide probes can be designed to detect 
alternatively spliced mRNAs, or to identify which of various hTRT alleles is expressed in a 
particular sample. 

In one illustrative embodiment, cDNA prepared by reverse transcription of 
1 5 total RNA from a test cell is amplified (e.g., using PCR). Typically the amplification product 
is labeled, e.g., by incorporation of a fluorescently labeled dNTP. The labeled cDNAs are 
then hybridized to a chip comprising oligonucleotide probes complementary to various 
subsequences of the hTRT gene. The positions of hybridization are determined (e.g., in 
accordance with the general methods of Shalon et al., 1996, Genome Research 6:639 or 
20 Schena et al., 1996, Genome Res. 6:639), and sequence (or other information) deduced from 
the hybridization pattern, by means well known in the art. 

In one embodiment, two cDNA samples, each labeled with a different 
fluorescent group, are hybridized to the same chip. The ratio of the hybridization of each 
labeled sample to sites complementary to the hTRT gene are then assayed. If both samples 
25 contain the same amount of hTRT mRNA, the ratio of the two fluors will be 1 : 1 (it will be 
appreciated that the signal from the fluors may need to be adjusted to account for any 
difference in the molar sensitivity of the fluors). In contrast, if one sample is from a healthy 
(or control) tissue and the second sample is from a cancerous tissue the fluor used in the 
second sample will predominate. 

30 

c) IN SITU HYBRIDIZATION 
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An alternative means for detecting expression of a gene encoding an hTRT 
protein is in situ hybridization. In situ hybridization assays are well known and are generally 
described in Angerer et al., Methods Enzymol., 152: 649-660 (1987) and Ausubel et al., 
supra. In an in situ hybridization assay, cells or tissue specimens are fixed to a solid support, 
5 typically in a permeablilized state, typically on a glass slide. The cells are then contacted 

with a hybridization solution at a moderate temperature to permit annealing of labeled nucleic 
acid probes (e.g., 35 S-labeled riboprobes, fluorescently labeled probes) completely or 
substantially complementary to hTRT. Free probe is removed by washing and/or nuclease 
digestion, and bound probe is visualized directly on the slide by autoradiography or an 
1 0 appropriate imaging techniques, as is known in the art. 

4) SPECIFIC DETECTION OF VARIANTS 

As noted supra and illustrated in the Examples (e.g., Example 9), 
amplification primers or probes can be selected to provide amplification products that span 

15 specific deletions, truncations, and insertions, thereby facilitating the detection of specific 
variants or abnormalities in the hTRT mRNA. 

One example of an hTRT variant gene product that may be detected is an 
hTRT RNA such as a product (SEQUENCE ID NO: 4) described supra and in Example 9. 
The biological function, if any, of the A 1 82 variants) is not known; however, the truncated 

20 hTRT protein putatively encoded by the variant may be involved in regulation of telomerase 
activity, e.g., by assembling a non-functional telomerase RNP that titrates telomerase 
components. Alternatively, negative regulation of telomerase activity could be accomplished 
by directing hTRT pre-mRNA (nascent mRNA) processing in a manner leading to 
elimination of the full length mRNA and reducing hTRT mRNA levels and increasing A 182 

25 hTRT RNA levels. For these and other reasons, the ability to detect Al 82 variants is useful. 
In addition, it will sometimes be desirable, in samples in which two species of hTRT RNA 
are present (such as a A 182 hTRT RNA and hTRT RNA encoding the full-length hTRT 
protein) to compare their relative and/or absolute abundance. 

The invention provides a variety of methods for detection of Al 82 variants. 

30 For example, amplification using primer pairs spanning the 1 82 basepair deletion will result 
in different sized products corresponding to the deleted and undeleted hTRT RNAs, if both 
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are present, which can be distinguished on the basis of size (e.g., by gel electrophoresis). 
Examples of primer pairs useful for amplifying the region spanning the 1 82 bp deletion 
include TCPL14 and TCP1.15 (primer set 1), or TCP1.25 and bTCP6 (primer set 2) (see 
Table 2). These primer pairs can be used individually or in a nested PCR experiment where 
5 primer set 1 is used first. It will also be apparent to one of skill that hybridization methods 
(e.g., Northern hybridization) or RNAse protection assays using an hTRT nucleic acid probe 
of the invention can be used to detect and distinguish hTRT RNA variants. 



three primers. Analogous to the semi-competitive quantitative PCR method described in 
1 0 greater detail supra, one primer is specific to each of the hTRT RNA species (e.g., as 

illustrated in Table 4) and one primer is complementary to both species (e.g., TCP 1.25 (2270- 
2288)). An example of a primer specific to SEQUENCE ID NO:. 1 is one that anneals within 
the 182 nucleotide sequence (i.e., nucleotides 2345 to 2526 of SEQUENCE ID NO: 1), e.g., 
TCP 1.73 (2465-2445). For example, a primer specific to SEQUENCE ID NO: 4 (a A 182 
1 5 variant) is one that anneals at nucleotides 2358 to 2339 of SEQUENCE ID NO: 4 (i.e., the 
site corresponding to the 182 nucleotide insertion in SEQUENCE ID NO: 1). The absolute 
abundance of the A 182 hTRT mRNA species or its relative abundance compared to the 
species encoding the full-length hTRT protein can be analyzed for correlation to cell state 
(e.g., capacity for indefinite proliferation). It will be appreciated that numerous other primers 
20 or amplification or detection methods can be selected based on the present disclosure. 



Another suitable method entails PCR amplification (or the equivalent) using 



TABLE 4 



ILLUSTRATIVE PRIMERS 



A182 species (e.g 



♦ / 



SEQUENCE ID NO. 4) specific primer: 



25 



5 1 - GGCACTGGACGTAGGACGTG - 3 



hTRT (SEQUENCE ID NO. 1) specific primer (TCP1.73) : 



5 ■ - CACTGCTGGCCTCATTCAGGG- 3 



Common (forward) primer (TCP1.25): 



5 ' - TACTGCGTGCGTCGGTATG - 3 1 



30 



Other variant hTRT genes or gene products that can be detected include those 
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characterized by premature stop codons, deletions, substitutions or insertions. Deletions can 
be detected by the decreased size of the gene, mRNA transcript, or cDNA. Similarly, 
insertions can be detected by the increased size of the gene, mRNA transcript, or cDNA 
Insertions and deletions could also cause shifts in the reading frame that lead to premature 
stop codons or longer open reading frames. Substitutions, deletions, and insertions can also 
be detected by probe hybridization. Alterations can also be detected by observing changes in 
the size of the variant hTRT polypeptide (e.g., by Western analysis) or by hybridization or 
specific amplification as appropriate. Alternatively, mutations can be determined by 
sequencing of the gene or gene product according to standard methods. In addition, sind as 
noted above, amplification assays and hybridization probes can be selected to target particular 
abnormalities specifically. For example, nucleic acid probes or amplification primers can be 
selected that specifically hybridize to or amplify, respectively, the region encompassing the 
deletion, substitution, or insertion. Where the hTRT gene harbors such a mutation, the probe 
will either (1) fail to hybridize or the amplification reaction will fail to provide specific 
amplification or cause a change in the size of the amplification product or hybridization 
signal; or (2) the probe or amplification reaction encompasses the entire deletion or either end 
of the deletion (deletion junction); or (3) similarly, probes and amplification primers can be 
selected that specifically target point mutations or insertions. 

5) DETECTION OF MUTANT hTRT ALLELES 

Mutations in the hTRT gene can be responsible for disease initiation or can 
contribute to a disease condition. Alterations of the genomic DNA of hTRT can affect levels 
of gene transcription, change amino acid residues in the hTRT protein, cause truncated hTRT 
polypeptides to be produced, alter pre-mRNA processing pathways (which can alter hTRT 
mRNA levels), and cause other consequences as well. 

Alterations of genomic DNA in non-hTRT loci can also affect expression of 
hTRT or telomerase by altering the enzymes or cellular processes that are responsible for 
regulating hTRT, hTR, and telomerase-associated protein expression and processing and RNP 
assembly and transport. Alterations which affect hTRT expression, processing, or RNP 
assembly could be important for cancer progression, for diseases of aging, for DNA damage 
diseases, and others. 
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Detection of mutations in hTRT mRNA or its gene and gene control elements 
can be accomplished in accordance with the methods herein in multiple ways. Illustrative 
examples include the following: A technique termed primer screening can be employed; 
PCR primers are designed whose 3' termini anneal to nucleotides in a sample DNA (or RNA) 
5 that are possibly mutated. If the DNA (or RNA) is amplified by the primers, then the 3' 
termini matched the nucleotides in the gene; if the DNA is not amplified, then one or both 
termini did not match the nucleotides in the gene, indicating a mutation was present Similar 
primer design can be used to assay for point mutations using the Ligase Chain Reaction 
(LCR, described supra). Restriction fragment length polymorphism, RFLP (Pourzand, C, 

10 Cerutti, P. (1993) Mutat. Res 288: 1 13-121), is another technique that can be applied in the 
present method. A Southern blot of human genomic DNA digested with various restriction 
enzymes is probed with an hTRT specific probe. Differences in the fragment number or sizes 
between the sample and a control indicate an alteration of the experimental sample, usually 
an insertion or deletion. Single strand conformation polymorphism, SSCP (Orrita, M., et al. 

15 (1989) PNAS USA 86:2766-70), is another technique that can be applied in the present 
method. SSCP is based on the differential migration of denatured wild-type and mutant 
single-stranded DNA (usually generated by PCR). Single-stranded DNA will take on a 
three-dimensional conformation that is sequence-specific. Sequence differences as small as a 
single base change can result in a mobility shift on a nondenaturing gel. SSCP is one of the 

20 most widely used mutation screening methods because of its simplicity. Denaturing Gradient 
Gel Electrophoresis, DGGE (Myers, R. M., Maniatis, T. and Lerman, L., (1987) Methods in 
Enzymology, 155: 501-527), is another technique that can be applied in the present method. 
DGGE identifies mutations based on the melting behavior of double-stranded DNA. 
Specialized denaturing electrophoresis equipment is utilized to observe the melting profile of 

25 experimental and control DNAs: a DNA containing a mutation will have a different mobility 
compared to the control in these gel systems. The examples discussed illustrate commonly 
employed methodology; many other techniques exist which are known by those skilled in the 
art and can be applied in accordance with the teachings herein. 

30 F. KARYOTYPE ANALYSIS 

The present invention further provides methods and reagents for karyotype or 
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other chromosomal analysis using hTRT-sequence probes and/or detecting or locating hTRT 
gene sequences in chromosomes from a human patient, human cell line, or non-human cell. 
In one embodiment, amplification (i.e., change in copy number), deletion (i.e., partial 
deletion), insertion, substitution, or changes in the chromosomal location (e.g., translocation) 
5 of an hTRT gene may be correlated with the presence of a pathological condition or a 
predisposition to developing a pathological condition (e.g., cancer). 

It has been determined by the present inventors that, in normal human cells, 
the hTRT gene maps close to the telomere of chromosome 5p (see Example 5, infra). The 
closest STS marker is D5S678 (see Figure 8). The location can be used to identify markers 

1 0 that are closely linked to the hTRT gene. The markers can be used to identify YACs, STSs, 
cosmids, BACs, lambda or PI phage, or other clones which contain hTRT genomic 
sequences or control elements. The markers or the gene location can be used to scan human 
tissue samples for alterations in the normal hTRT gene location, organization or sequence that 
is associated with the occurrence of a type of cancer or disease. This information can be used 

15 in a diagnostic or prognostic manner for the disease or cancer involved. Moreover, the nature 
of any alterations to the hTRT gene can be informative as to the nature by which cells 
become immortal. For instance, a translocation event could indicate that activation of hTRT 
expression occurs in some cases by replacing the hTRT promoter with another promoter 
which directs hTRT transcription in an inappropriate manner. Methods and reagents of the 

20 invention of this type can be used to inhibit hTRT activation. The location may also be 
useful for determining the nature of hTRT gene repression in normal somatic cells, for 
instance, whether the location is part of non-expressing heterochromatin. Nuclease 
hypersensitivity assays for distinguishing heterochromatin and euchromatin are described, for 
example, in Wu et al., 1979, Cell 16:797; Groudine and Weintraub, 1982, Cell 30:13 1 Gross 

25 and Garrard, 1988, Ann. Rev. Biochem. 57:159. 

In one embodiment, alterations to the hTRT gene are identified by karyotype 
analysis, using any of a variety of methods known in the art. One useful technique is in 
situ hybridization (ISH). Typically, when in situ hybridization techniques are used for 
karyotype analysis, a detectable or detectably-labeled probe is hybridized to a chromosomal 

30 sample in situ to locate an hTRT gene sequence. Generally, ISH comprises one or more of 
the following steps: (1) fixation of the tissue, cell or other biological structure to be analyzed; 
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(2) prehybridization treatment of the biological structure to increase accessibility of target 
DNA (e.g., denaturation with heat or alkali), and to reduce nonspecific binding (e.g., by 
blocking the hybridization capacity of repetitive sequences, e.g., using human genomic 
DNA); (3) hybridization of one or more nucleic acid probes (e.g., conventional nucleic acids, 
5 PNAs, or probes containing other nucleic acid analogs) to the nucleic acid in the biological 
structure or tissue; (4) posthybridization washes to remove nucleic acid fragments not bound 
in the hybridization; and, (5) detection of the hybridized nucleic acid fragments. The reagents 
used in each of these steps and conditions for their use vary depending on the particular 
application. It will be appreciated that these steps can be modified in a variety of ways well 
1 0 known to those of skill in the art. 

In one embodiment of ISH, the hTRT probe is labeled with a fluorescent label 
(fluorescent in situ hybridization; "FISH"). Typically, it is desirable to use dual color 
fluorescent in situ hybridization, in which two probes are utilized, each labeled by a different 
fluorescent dye. A test probe that hybridizes to the hTRT sequence of interest is labeled with 
1 5 one dye, and a control probe that hybridizes to a different region is labeled with a second dye. 
A nucleic acid that hybridizes to a stable portion of the chromosome of interest, such as the 
centromere region, can be used as the control probe. In this way, one can account for 
differences between efficiency of hybridization from sample to sample. 

The ISH methods for detecting chromosomal abnormalities (e.g., FISH) can be 
20 performed on nanogram quantities of the subject nucleic acids. Paraffin embedded normal 
tissue or tumor sections can be used, as can fresh or frozen material, tissues, or sections. 
Because FISH can be applied to limited material, touch preparations prepared from 
uncultured primary tumors can also be used (see, e.g., Kallioniemi et al., 1992, Cytogenet. 
Cell Genet. 60:190). For instance, small biopsy tissue samples from tumors can be used for 
25 touch preparations (see, e.g., Kallioniemi et al., supra). Small numbers of cells obtained from 
aspiration biopsy or cells in bodily fluids (e.g., blood, urine, sputum and the like) can also be 
analyzed. For prenatal diagnosis, appropriate samples will include amniotic fluid, maternal 
blood, and the like. Useful hybridization protocols applicable to the methods and reagents 
disclosed here are described in Pinkel et al., 1988, Proc. Natl. Acad. Sci. USA, 85:9138; EPO 
30 Pub. No. 430,402; Choo, ed., Methods in Molecular Biology Vol. 33 : In Situ 

Hybridization Protocols, Humana Press, Totowa, New Jersey, (1994); and Kallioniemi et 
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aL, supra. 

Other techniques useful for karyotype analysis include, for example, 
techniques such as quantitative Southern blotting, quantitative PCR, or comparative genomic 
hybridization (Kallioniemi et aL, 1992, Science, 258:818), using the hTRT probes and 
5 primers of the invention which may be used to identify amplification, deletion, insertion, 
substitution or other rearrangement of hTRT sequences in chromosomes in a biological 
sample. 



G. TRT POLYPEPTIDE ASSAYS 
10 1) GENERALLY 

The present invention provides methods and reagents for detecting and 
quantitating hTRT polypeptides. These methods include analytical biochemical methods 
such as electrophoresis, mass spectroscopy, gel shift, capillary electrophoresis, 
chromatographic methods such as size exclusion chromatography, high performance liquid 

1 5 chromatography (HPLC), thin layer chromatography (TLC), hyperdiffusion chromatography, 
and the like, or various immunological methods such as fluid or gel precipitin reactions, 
immunodiffusion (single or double), immunoelectrophoresis, radioimmunoassay (RIA), 
enzyme-linked immunosorbent assays (ELISAs), immunofluorescent assays, western 
blotting, mass spectrometry, and others described below and apparent to those of skill in the 

20 art upon review of this disclosure. 



2) ELECTROPHORETIC ASSAYS 

In one embodiment, the hTRT polypeptides are detected in an electrophoretic 
protein separation; in one aspect, a two-dimensional electrophoresis system is employed. 
25 Means of detecting proteins using electrophoretic techniques are well known to those of skill 
in the art (see generally, R. Scopes (1982) Protein Purification, Springer-Verlag, N.Y.; 
Deutscher, (1990) Methods in Enzymology Vol. 1 82: Guide to Protein Purification, 
Academic Press, Inc., N.Y.). 

In a related embodiment, a mobility shift assay (see, e.g., Ausubel et aL, 
30 supra) is used* For example, labeled-hTR will associate with hTRT and migrate with altered 
mobility upon electrophoresis in a nondenaturing polyacrylamide gel or the like. Thus, for 
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example, if an (optionally labeled) hTR probe or a (optionally labeled) telomerase primer is 
mixed with a sample containing hTRT, or coexpressed with hTRT (e.g., in a cell-free 
expression system) the presence of hTRT protein (or a polynucleotide encoding hTRT) in the 
sample will result in a detectable alteration of hTR mobility. 

5 

3) IMMUNOASSAYS 

a) GENERALLY 

The present invention also provides methods for detection of hTRT 
polypeptides employing one or more antibody reagents of the invention (i.e., immunoassays). 

10 As used herein, an immunoassay is an assay that utilizes an antibody (as broadly defined 
herein and specifically includes fragments, chimeras and other binding agents) that 
specifically binds an hTRT polypeptide or epitope. Antibodies of the invention may be made 
by a variety of means well known to those of skill in the art, e.g., as described supra. 

A number of well established immunological binding assay formats suitable 

15 for the practice of the invention are known (see, e.g., U.S. Patents 4,366,241; 4,376,110; 

4,517,288; and 4,837,168). See, e.g., Methods in Cell Biology Volume 37: Antibodies 
in Cell Biology, Asai, ed. Academic Press, Inc. New York (1993); Basic and Clinical 
Immunology 7th Edition, Stites & Terr, eds. (1991); Harlow and Lane, supra [e.g., Chapter 
14], and Ausubel et aL, supra, [e.g., Chapter 1 1], each of which is incorporated by reference 

20 in its entirety and for all purposes. Typically, immunological binding assays (or 

immunoassays) utilize a "capture agent" to specifically bind to and, often, immobilize the 
analyte. In one embodiment, the capture agent is a moiety that specifically binds to an hTRT 
polypeptide or subsequence, such as an anti-hTRT antibody. In an alternative embodiment, 
the capture agent may bind an hTRT-associated protein or RNA under conditions in which 

25 the hTRT-associated molecule remains bound to the hTRT (such that if the hTRT-associated 
molecule is immobilized the hTRT protein is similarly immobilized). It will be understood 
that in assays in which an hTRT-associated molecule is captured the associated hTRT protein 
will usually be present and so can be detected, e.g., using an anti-hTRT antibody or the like. 
Immunoassays for detecting protein complexes are known in the art (see, e.g., Harlow and 

30 Lane, supra, at page 583). 

Usually the hTRT gene product being assayed is detected directly or indirectly 
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using a detectable label The particular label or detectable group used in the assay is usually 
not a critical aspect of the invention, so long as it does not significantly interfere with the 
specific binding of the antibody or antibodies used in the assay. The label may be covalently 
attached to the capture agent (e.g., an anti-TRT antibody), or may be attached to a third 
moiety, such as another antibody, that specifically binds to, e.g., : the hTRT polypeptide (at a 
different epitope than recognized by the capture agent), the capture agent (e.g., an anti-(first 
antibody) immunoglobulin); an anti-TRT antibody; an antibody that binds an anti-TRT 
antibody; or, an antibody/telomerase complex (e.g., via binding to an associated molecule 
such as a telomerase-associated protein). Other proteins capable of binding an antibody used 
in the assay, such as protein A or protein G, may also be labeled. In some embodiments, it 
will be useful to use more than one labeled molecule (i.e., ones that can be distinguished from 
one another). In addition, when the target bound (e.g., immobilized) by the capture agent 
(e.g., anti-hTRT antibody) is a complex (i.e., a complex of hTRT and a TRT-associated 
protein, hTR, or other TRT associated molecule), a labeled antibody that recognizes the 
protein or RNA associated with the hTRT protein can be used. When the complex is a 
protein-nucleic acid complex (e.g., TRT-hTR), the reporter molecule can be a polynucleotide 
or other molecule (e.g., enzyme) that recognizes the RNA component of the complex. 

Some immunoassay formats do not require the use of labeled components. 
For instance, agglutination assays can be used to detect the presence of the target antibodies. 
In this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, the components do not need to be labeled, and the presence of the 
target antibody can be detected by simple visual inspection. 

b) NON-COMPETITIVE ASSAY FORMATS 
The present invention provides methods and reagents for competitive and 
noncompetitive immunoassays for detecting hTRT polypeptides. Noncompetitive 
immunoassays are assays in which the amount of captured analyte (in this case hTRT) is 
directly measured. One such assay is a two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on the hTRT protein . See, 
e.g., Maddox et aL, 1983, J. Exp. Med, 158:121 1 for background information. In one 
preferred "sandwich" assay, the capture agent (e.g., an anti-TRT antibody) is bound directly 
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to a solid substrate where it is immobilized. These immobilized antibodies then capture any 
hTRT protein present in the test sample. The hTRT thus immobilized can then be labeled, 
i.e., by binding to a second anti-hTRT antibody bearing a label. Alternatively, the second 
anti-hTRT antibody may lack a label, but be bound by a labeled third antibody specific to 
antibodies of the species from which the second antibody is derived. The second antibody 
alternatively can be modified with a detectable moiety, such as biotin, to which a third 
labeled molecule can specifically bind, such as enzyme-labeled streptavidin. 

c) COMPETITIVE ASSAY FORMATS 

In competitive assays, the amount of hTRT protein present in the sample is 
measured indirectly by measuring the amount of an added (exogenous) hTRT displaced (or 
competed away) from a capture agent (e.g., anti-TRT antibody) by the hTRT protein present 
in the sample. In one competitive assay, a known amount of labeled hTRT protein is added 
to the sample and the sample is then contacted with a capture agent (e.g., an antibody that 
specifically binds hTRT protein). The amount of exogenous (labeled) hTRT protein bound to 
the antibody is inversely proportional to the concentration of hTRT protein present in the 
sample. In one embodiment, the antibody is immobilized on a solid substrate. The amount 
of hTRT protein bound to the antibody may be determined either by measuring the amount of 
hTRT protein present in a TRT/antibody complex, or alternatively by measuring the amount 
of remaining uncomplexed TRT protein. The amount of hTRT protein may be detected by 
providing a labeled hTRT molecule. 

A hapten inhibition assay is another example of a competitive assay. In this 
assay hTRT protein is immobilized on a solid substrate. A known amount of anti-TRT 
antibody is added to the sample, and the sample is then contacted with the immobilized hTRT 
protein. In this case, the amount of anti-TRT antibody bound to the immobilized hTRT 
protein is inversely proportional to the amount of hTRT protein present in the sample. The 
amount of immobilized antibody may be detected by detecting either the immobilized 
fraction of antibody or the fraction of the antibody that remains in solution. In this aspect, 
detection may be direct, where the antibody is labeled, or indirect where the label is bound to 
a molecule that specifically binds to the antibody as described above. 
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d) OTHER ASSAY FORMATS 
The invention also provides reagents and methods for detecting and 
quantifying the presence of hTRT in the sample by using an immunoblot (Western blot) 
format. In this format, hTRT polypeptides in a sample are separated from other sample 
5 components by gel electrophoresis (e.g., on the basis of molecular weight), the separated 
proteins are transferred to a suitable solid support (such as a nitrocellulose filter, a nylon 
filter, derivatized nylon filter, or the like), and the support is incubated with anti-TRT 
antibodies of the invention. The anti-TRT antibodies specifically bind to hTRT or other TRT 
on the solid support. These antibodies may be directly labeled or alternatively may be 
10 subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) or 
other labeling reagents that specifically bind to the anti-TRT antibody. 

Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals can then be detected according to standard 
15 techniques (see, Monroe et al, 1986, Amer. Clin, Prod. Rev. 5:34). 

As noted supra, assay formats using FACS (and equivalent instruments or 
methods) have advantages when measuring hTRT gene products in a heterogeneous sample 
(such as a biopsy sample containing both normal and malignant cells). 

20 e) SUBSTRATES, SOLID SUPPORTS, MEMBRANES, FILTERS 

As noted supra, depending upon the assay, various components, including the 
antigen, target antibody, or anti-hTRT antibody, may be bound to a solid surface or support 
(i.e., a substrate, membrane, or filter paper). Many methods for immobilizing biomolecules 
to a variety of solid surfaces are known in the art. For instance, the solid surface may be a 

25 membrane (e.g., nitrocellulose), a microtiter dish (e.g., PVC, polypropylene, or polystyrene), 
a test tube (glass or plastic), a dipstick (e.g. glass, PVC, polypropylene, polystyrene, latex, 
and the like), a microcentrifuge tube, or a glass or plastic bead. The desired component may 
be covalently bound or noncovalently attached through nonspecific bonding. 

A wide variety of organic and inorganic polymers, both natural and synthetic 

30 may be employed as the material for the solid surface. Illustrative polymers include 
polyethylene, polypropylene, poly(4-methylbutene), polystyrene, polymethacrylate, 
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poly(ethylene terephthalate), rayon, nylon, poly(vinyl butyrate), polyvinylidene difluoride 
(PVDF), silicones, polyformaldehyde, cellulose, cellulose acetate, nitrocellulose, and the like. 
Other materials which may be employed, include paper, glasses, ceramics, metals, metalloids, 
semiconductive materials, cements or the like. In addition, substances that form gels, such as 
5 proteins (e.g., gelatins), lipopoly saccharides, silicates, agarose and polyacrylamides can be 
used. Polymers which form several aqueous phases, such as dextrans, polyalkylene glycols 
or surfactants, such as phospholipids, long chain (12-24 carbon atoms) alkyl ammonium salts 
and the like are also suitable. Where the solid surface is porous, various pore sizes may be 
employed depending upon the nature of the system. 

10 In preparing the surface, a plurality of different materials may be employed, 

particularly as laminates, to obtain various properties. For example, protein coatings, such as 
gelatin can be used to avoid non-specific binding, simplify covalent conjugation, enhance 
signal detection or the like. 

If covalent bonding between a compound and the surface is desired, the 

15 surface will usually be poly functional or be capable of being polyfunctionalized. Functional 
groups which may be present on the surface and used for linking can include carboxylic 
acids, aldehydes, amino groups, cyano groups, ethylenic groups, hydroxyl groups, mercapto 
groups and the like. The manner of linking a wide variety of compounds to various surfaces 
is well known and is amply illustrated in the literature. See, for example, Immobilized 

20 Enzymes, Ichiro Chibata, Halsted Press, New York, 1978, and Cuatrecasas (1970) J. Biol 
Chem. 245 3059). 

In addition to covalent bonding, various methods for noncovalently binding an 
assay component can be used. Noncovalent binding is typically nonspecific absorption of a 
compound to the surface. 

25 One of skill in the art will appreciate that it is often desirable to reduce non- 

specific binding in immunoassays. Particularly, where the assay involves an antigen or 
antibody immobilized on a solid substrate it is desirable to minimize the amount of non- 
specific binding to the substrate. Means of reducing such non-specific binding are well 
known to those of skill in the art. Typically, this involves coating the substrate with a 

30 proteinaceous composition. In particular, protein compositions such as bovine serum 

albumin (BSA), nonfat powdered milk, and gelatin are widely used with powdered milk 
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sometimes preferred. Alternatively, the surface is designed such that it nonspecifically binds 
one component but does not significantly bind another. For example, a surface bearing a 
lectin such as Concanavalin A will bind a carbohydrate containing compound but not a 
labeled protein that lacks glycosylation. Various solid surfaces for use in noncovalent 
5 attachment of assay components are reviewed in U.S. Patent Nos. 4,447,576 and 4,254,082. 

H) ASSAYS FOR ANTI-TRT ANTIBODIES 

The present invention also provides reagents and assays for detecting 
hTRT-specific immunoglobulins. In one embodiment, immobilized hTRT (e.g., recombinant 

10 hTRT bound to a microassay plate well) is incubated with serum from a patient under 

conditions in which anti-hTRT antibodies, if present, bind the immobilized hTRT. After 
washing to remove nonspecifically bound immunoglobulin, bound serum antibodies can be 
detected, if they are present, by adding detectably labeled anti-(human Ig) antibodies 
(alternative embodiments and variations are well known to those of skill in the art; see, e.g., 

15 Harlow, supra, at Ch. 14). These assays are useful for detecting anti-hTRT antibodies in any 
source including animal or human serum or a carrier such as saline. In one embodiment, the 
assays are used to detect or monitor an immune response to hTRT proteins in a patient, 
particularly an autoimmune (e.g., anti-telomerase) response. Anti-hTRT antibodies may be 
present in the serum or other tissues or fluids from a patient suffering from an autoimmune 

20 disease or other condition. 

I) ASSAY COMBINATIONS 

The diagnostic and prognostic assays described herein can be carried out in 
various combinations and can also be carried out in conjunction with other diagnostic or 

25 prognostic tests. For example, when the present methods are used to detect the presence of 
cancer cells in patient sample, the presence of hTRT can be used to determine the stage of the 
disease, whether a particular tumor is likely to invade adjoining tissue or metastasize to a 
distant location, and whether a recurrence of the cancer is likely. Tests that may provide 
additional information include microscopic analysis of biopsy samples, detection of antigens 

30 (e.g., cell-surface markers) associated with tumorigenicity (e.g., using histocytochemistry, 
FACS, or the like), imaging methods (e.g., upon administration to a patient of labeled anti- 
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tumor antibodies), telomerase activity assays, telomere length assays, hTR assays, or the like. 
Such combination tests can provide useful information regarding the progression of a disease. 

It will also be recognized that combinations of assays can provide useful 
information. For example, and as noted above, assays for hTRT mRNA can be combined 
5 with assays for hTR (human telomerase RNA) or telomerase activity (i.e., TRAP) assays to 
provide information about telomerase assembly and function. 

J) KITS 

The present invention also provides kits useful for the screening, monitoring, 
1 0 diagnosis and prognosis of patients with a telomerase-related condition, or for determination 
of the level of expression of hTRT in cells or cell lines. The kits include one or more 
reagents for determining the presence or absence of an hTRT gene product (RNA or protein) 
or for quantifying expression of the hTRT gene. Preferred reagents include nucleic acid 
primers and probes that specifically bind to the hTRT gene, RNA, cDNA, or portions thereof, 
15 along with proteins, peptides, antibodies, and control primers, probes, oligonucleotides, 
proteins, peptides and antibodies. Other materials, including enzymes (e.g., reverse 
transcriptases, DNA polymerases, ligases), buffers, reagents (labels, dNTPs), may be 
included. 

The kits may include alternatively, or in combination with any of the other 
20 components described herein, an antibody that specifically binds to hTRT polypeptides or 
subsequences thereof. The antibody can be monoclonal or polyclonal. The antibody can be 
conjugated to another moiety such as a label and/or it can be immobilized on a solid support 
(substrate). The kit(s) may also contain a second antibody for detection of hTRT 
polypeptide/antibody complexes or for detection of hybridized nucleic acid probes, as well as 
25 one or more hTRT peptides or proteins for use as control or other reagents. 

The antibody or hybridization probe may be free or immobilized on a solid 
support such as a test tube, a microtiter plate, a dipstick and the like. The kit may also 
contain instructional materials teaching the use of the antibody or hybridization probe in an 
assay for the detection of TRT. The kit may contain appropriate reagents for detection of 
30 labels, or for labeling positive and negative controls, washing solutions, dilution buffers and 
the like. 
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In one embodiment, the kit includes a primer pair for amplifying hTRT 
mRNA. Such a kit may also include a probe for hTRT amplified DNA and/or a polymerase, 
buffer, dNTPs, and the like. In another, the kit comprises a probe, optionally a labeled probe. 
In another, the kit comprises an antibody. 

5 

X. IDENTIFICATION OF MODULATORS OF TELOMERASE ACTIVITY 
A. GENERALLY 

The invention provides compounds and treatments that modulate the activity 
1 0 or expression of a telomerase or telomerase component (e.g., hTRT protein). The invention 
also provides assays and screening methods (including high-throughput screens) for 
identification of compounds and treatments that modulate telomerase activity or expression. 
These modulators of telomerase activity and expression (hereinafter referred to as 
"modulators") include telomerase agonists (which increase telomerase activity and/or 
1 5 expression) and telomerase antagonists (which decrease telomerase activity and/or 
expression). 

The modulators of the invention have a wide variety of uses. For example, it 
is contemplated that telomerase modulators will be effective therapeutic agents for treatment 
of human diseases. Screening for agonist activity and transcriptional or translational 

20 activators provides for compositions that increase telomerase activity in a cell (including a 
telomere dependent replicative capacity, or a "partial" telomerase activity). Such agonist 
compositions provide for methods of immortalizing otherwise normal untransformed cells, 
including cells which can express useful proteins. Such agonists can also provide for 
methods of controlling cellular senescence. Conversely, screening for antagonist activity 

25 provides for compositions that decrease telomere dependent replicative capacity, thereby 

mortalizing otherwise immortal cells, such as cancer cells. Screening for antagonist activity 
provides for compositions that decrease telomerase activity, thereby preventing unlimited cell 
division of cells exhibiting unregulated cell growth, such as cancer cells. Illustrative diseases 
and conditions that may be treated using modulators are listed herein, e.g., in Sections VII 

30 and IX, supra. In general, the modulators of the invention can be used whenever it is desired 
to increase or decrease a telomerase activity in a cell or organism. Thus, in addition to use in 
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treatment of disease, a modulator that increases hTRT expression levels can be used to 
produce a cultured human cell line having properties as generally described in Section VIII, 
supra, and various other uses that will be apparent to one of skill. 

A compound or treatment modulates "expression" of telomerase or a 

5 telomerase component when administration of the compound or treatment changes the rate or 
level of transcription of the gene encoding a telomerase component (e.g., the gene encoding 
hTRT mRNA), affects stability or post-transcriptional processing of RNA encoding a 
telomerase component (e.g., transport, splicing, polyadenylation, or other modification), 
affects translation, stability, post-translational processing or modification of an encoded 

10 protein (e.g., hTRT), or otherwise changes the level of functional (e.g., catalytically active) 
telomerase RNP. A compound or treatment affects a telomerase "activity" when 
administration of the compound or treatment changes a telomerase activity such as any 
activity described in Section IV(B), supra (e.g., including processive or non-processive 
telomerase catalytic activity; telomerase processivity; conventional reverse transcriptase 

15 activity; nucleolytic activity; primer or substrate binding activity; dNTP binding activity; 
RNA binding activity; telomerase RNP assembly; and protein binding activity). It will be 
appreciated that there is not necessarily a sharp delineation between changes in "activity" and 
changes in "expression," and that these terms are used for ease of discussion and not for 
limitation. It will also be appreciated that the modulators of the invention should specifically 

20 affect telomerase activity or expression (e.g., without generally changing the expression of 
housekeeping proteins such as actin) rather than, for example, reducing expression of a 
telomerase component by nonspecific poisoning of a target cell. 

B. ASSAYS FOR IDENTIFICATION OF TELOMERASE MODULATORS 

25 The invention provides methods and reagents to screen for compositions or 

compounds capable of affecting expression of a telomerase or telomerase component, capable 
of modifying the DNA replicative capacity of telomerase, or otherwise modifying the ability 
of the telomerase enzyme and TRT protein to synthesize telomeric DNA ("full activity"). 
The invention also provides screens for modulators of any or all of hTRT' s "partial 

30 activities." Thus, the present invention provides assays that can be used to screen for agents 
that increase the activity of telomerase, for example, by causing hTRT protein or telomerase 
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to be expressed in a cell in which it normally is not expressed or by increasing telomerase 
activity levels in telomerase positive cells. 

Telomerase or telomerase subunit proteins or their catalytic or immunogenic 
fragments or oligopeptides thereof, can be used for screening therapeutic compounds in any 
5 of a variety of drug screening techniques. The fragment employed in such a test may be free 
in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. The 
formation of binding complexes, between telomerase or the subunit protein and the agent 
being tested, may be measured. 

In various embodiments, the invention includes methods for screening for 

1 0 antagonists that: bind to the enzyme's active site; inhibit the association of its RNA moiety, 
telomerase-associated proteins, nucleotides, or telomeric DNA to telomerase or hTRT 
protein; promote the disassociation of the enzyme complex; interfere with transcription of the 
telomerase RNA moiety (e.g., hTR); or inhibit any of the "partial activities" described 
herein. The invention provides methods for screening for compositions that inhibit the 

15 association of nucleic acid and/or telomerase-associated compositions with hTRT, such as the 
association of hTR with hTRT or the association of hTRT with the human homologs of p80 
or p95 or another associated protein, or association of hTRT with a telomere or a nucleotide; 
screening for compositions that promote the disassociation or promote the association (i.e., 
assembly) of the enzyme complex, such as an antibody directed to hTR or hTRT; screening 

20 for agents that effect the processivity of the enzyme; and screening for nucleic acids and other 
compositions that bind to telomerase, such as a nucleic acid complementary to hTR. The 
invention further contemplates screening for compositions that increase or decrease the 
transcription of the hTRT gene and/or translation of the hTRT gene product. The invention 
also contemplates a method of screening for telomerase modulators in animals, in one 

25 embodiment, by reconstituting a telomerase activity, or an anti-telomerase activity, in an 
animal, such as a transgenic animal. The invention provides for in vivo assays systems that 
include "knockout" models, in which one or several units of the endogenous telomerase, 
telomerase RNA moiety and/or telomerase-associated proteins have been deleted or inhibited. 
The endogenous telomerase activity, full or partial, can remain or be absent. In one 

30 embodiment, an exogenous telomerase activity, full or partial, is reconstituted. 

In one embodiment of the invention, a variety of partial activity telomerase 
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assays are provided to identify a variety of different classes of modulators of telomerase 
activity. The "partial activity" assays of the invention allow identification of classes of 
telomerase activity modulators that might otherwise not be detected in a "full activity" 
telomerase assay. One partial activity assay involves the non-processive activity of TRT and 

5 telomerase. The processive nature of telomerase is described by Morin (1989) Cell 59:521- 
529; see also Prowse (1993) "Identification of a nonprocessive telomerase activity from 
mouse cells" Proc. Natl Acad. Set USA 90:1493-1497. Another partial activity assay of the 
invention exploits the "reverse-transcriptase-like" activity of telomerase. In these assays, one 
assays the reverse transcriptase activity of the hTRT protein. See Lingner (1997) "Reverse 

10 transcriptase motifs in the catalytic subunit of telomerase" Science 276:561-567. Another 
partial activity assay of the invention exploits the "nucleolytic activity" of hTRT and 
telomerase, involving the enzyme's removing of at least one nucleotide, typically guanosine, 
from the y strand of a primer. This nucleolytic activity has been observed in Tetrahymena 
telomerase by Collins (1993) "Tetrahymena telomerase catalyzes nucleolytic cleavage and 

15 nonprocessive elongation" Genes Dev 7:1364-1376. Another partial activity assay of the 

invention involves analyzing hTRT's and telomerase' s ability to bind nucleotides as part of 
its enzymatic processive DNA polymerization activity. Another partial activity assay of the 
invention involves analyzing hTRT's or telomerase's ability to bind its RNA moiety, Le. 9 
hTR for human cells, used as a template for telomere synthesis. Additional partial activity 

20 assays of the invention involve analyzing hTRT's and telomerase's ability to bind 

chromosomes in vivo, or to bind oligonucleotide primers in vitro or in reconstituted systems, 
or to bind proteins associated with chromosomal structure (see, for an example of such a 
protein, Harrington (1995) J Biol Chem 270: 8893-8901). Chromosomal structures which 
bind hTRT include, for example, telomeric repeat DNA, telomere proteins, histones, nuclear 

25 matrix protein, cell division/ cell cycle control proteins and the like. 

In one embodiment, an assay for identification of modulators comprises 
contacting one or more cells (i.e., "test cells") with a test compound, and determining whether 
the test compound affects expression or activity of a telomerase (or telomerase component) in 
the cell. Usually this determination comprises comparing the activity or expression in the test 

30 cell compared to a similar cell or cells (i.e., control cells) that have not been contacted with 
the test compound. Alternatively, cell extracts may be used in place of intact cells. In a 
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related embodiment, the test compound is administered to a multicellular organism (e.g., a 
plant or animal). The telomerase or telomerase component may be wholly endogenous to the 
cell or multicellular organism (i.e., encoded by naturally occurring endogenous genes), or 
may be a recombinant cell or transgenic organism comprising one or more recombinantly 

5 expressed telomerase components (e.g., hTRT, hTR, telomerase-associated proteins), or may 
have both endogenous and recombinant components. Thus, in one embodiment, telornerase- 
activity-modulators are administered to mortal cells. In another embodiment, telomerase- 
activity-modulators are administered to immortal cells. For example, antagonists of 
telomerase-mediated DNA replication can be identified by administering the putative 

10 inhibitory composition to a cell that is known to exhibit significant amounts of telomerase 
activity, such as cancer cells, and measuring whether a decrease in telomerase activity, 
telomere length, or proliferative capacity is observed, all of which are indicative of a 
compound with antagonist activity. 

In another embodiment, a modulator is identified by monitoring a change in a 

1 5 telomerase activity of a ribonucleoprotein complex (RNP) comprising a TRT (e.g., hTRT) 
and a template RNA (e.g., hTR), which RNP is reconstituted in vitro (e.g., as described in 
Example 7, infra). 

In yet another embodiment, the modulator is identified by monitoring a change 
in expression of a TRT gene product (e.g., RNA or protein) in a cell, animal, in vitro 
20 expression system, or other expression system. 

In still another embodiment, the modulator is identified by changing the 
expression of a reporter gene, such as that described in Example 15, whose expression is 
regulated, in whole or part, by a naturally occurring TRT regulatory element such as a 
promoter or enhancer. In a related embodiment, the ability of a test compound to bind to a 
25 telomerase component (e.g., hTRT), RNA, or gene regulatory sequence (e.g., the TRT gene 
promoter) is assayed. 

In another embodiment, the modulator is identified by observing changes in 
hTRT pre-mRNA processing, for example, alternatively spliced products, alternative 
poly-adenylation events, RNA cleavage, and the like. In a related embodiment the activity of 
3 0 the modulator can be observed by monitoring the production of variant hTRT polypeptides, 
some of which may possess dominant-negative telomerase regulation activity. 
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Assay formats for identification of compounds that affect expression and 
activity of proteins are well known in the biotechnological and pharmaceutical industries, and 
numerous additional assays and variations of the illustrative assays provided supra will be 
apparent to those of skill. 

5 Changes in telomerase activity or expression can be measured by any suitable 

method. Changes in levels of expression of a telomerase component (e.g., hTRT protein) or 
precursor (e.g., hTRT mRNA) can be assayed using methods well known to those of skill, 
some of which are described hereinabove, e.g., in Section IX and including monitoring levels 
of TRT gene products (e.g., protein and RNAs) by hybridization (e.g., using the TRT probes 

10 and primers of the invention), immunoassays (e.g., using the anti-TRT antibodies of the 
invention), RNAse protection assays, amplification assays, or any other suitable detection 
means described herein or known in the art. Quantitating amounts of nucleic acid in a sample 
(e.g., evaluating levels of RNA, e.g., hTR or hTRT mRNA) is also useful in evaluating cis- or 
trans- transcriptional regulators. 

15 Similarly, changes in telomerase activity can be measured using methods such 

as those described herein (e.g., in Section IV(B), supra) or other assays of telomerase 
function. Quantitation of telomerase activity, when desired, may be carried out by any 
method, including those disclosed herein. Telomerase antagonists that can cause or 
accelerate loss of telomeric structure can be identified by monitoring and measuring their 

20 effect on telomerase activity in vivo, ex vivo, or in vitro, or by their effects on telomere length 
(as measured or detected through staining, use of tagged hybridization probes or other means) 
or, simply, by the inhibition of cell division of telomerase positive cancer cells (critical 
shortening of telomeres leads to a phenomenon termed "crisis" or M2 senescence (Shay, 
1991) Biochem. Biophys. Acta 1072:1-7), which cancer cells have bypassed by the activation 

25 of telomerase, but which, in the absence of telomerase, will lead to their senescence or death 
through chromosomal deletion and rearrangement). The in vivo human telomerase activity 
reconstitution provides for a method of screening for telomerase modulators in cells or 
animals from any origin. Such agonists can be identified in an activity assay of the invention, 
including measurements of changes in telomere length. Other examples of assays measuring 

30 telomerase activity in cells include assays for the accumulation or loss of telomere structure, 
the TRAP assay or a quantitative polymerase chain reaction assay. 
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In one embodiment, the assays of the invention also include a method where 
the test compound produces a statistically significant decrease in the activity of hTRT as 
measured by the incorporation of a labeled nucleotide into a substrate compared to the 
relative amount of incorporated label in a parallel reaction lacking the test compound, thereby 

5 determining that the test compound is a telomerase inhibitor. 

The methods of the invention are amenable to adaptations from protocols 
described in the scientific and patent literature and known in the art. For example, when a 
telomerase or TRT protein of this invention is used to identify compositions which act as 
modulators of telomerase activities, large numbers of potentially useful molecules can be 

10 screened in a single test. The modulators can have an inhibitory (antagonist) or potentiating 
(agonist) effect on telomerase activity. For example, if a panel of 1,000 inhibitors is to be 
screened, all 1,000 inhibitors can potentially be placed into one microtiter well and tested 
simultaneously. If such an inhibitor is discovered, then the pool of 1,000 can be subdivided 
into 10 pools of 100 and the process repeated until an individual inhibitor is identified. 

15 In drug screening large numbers of compounds are examined for their ability 

to act as telomerase modulators, a process greatly accelerated by the techniques of hi gh 
throughput screening. The assays for telomerase activity, full or partial, described herein may 
be adapted to be used in a high throughput technique. Those skilled in the art appreciate that 
there are numerous methods for accomplishing this purpose. 

20 Another technique for drug screening which may be applied for high 

throughput screening of compounds having suitable binding affinity to the telomerase or 
telomerase protein subunit is described in detail in "Determination of Amino Acid Sequence 
Antigenicity" by Geysen, (Geysen, WO Application 84/03564, published on September 13, 
1984, incorporated herein by reference). In summary, large numbers of different small 

25 peptide test compounds are synthesized on a solid substrate, such as plastic pins or some 
other surface. The peptide test compounds are reacted with fragments of telomerase or 
telomerase protein subunits and washed. Bound telomerase or telomerase protein subunit is 
then detected by methods well known in the art. Substantially purified telomerase or 
telomerase protein subunit can also be coated directly onto plates for use in the 

30 aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies can be 
used to capture the peptide and immobilize it on a solid support. 
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This invention also contemplates the use of competitive drug screening assays 
in which neutralizing antibodies capable of binding telomerase or subunit protein(s) 
specifically compete with a test compound for binding telomerase or the subunit protein. 
Antibodies can also be used to detect the presence of any peptide which shares one or more 
antigenic determinants with the telomerase or subunit protein. 

Additional methods for identifying modulators of a telomerase activity have 
been described in U.S. Patent No. 5,645,986, which is incorporated herein by reference. It 
will be appreciated that the present invention provides improvements to previously known 
methods, in part by providing reagents such as hTRT polynucleotides, probes and primers, 
highly purified hTR, hTRT and telomerase, as well as anti-telomerase and anti-TRT 
antibodies, all of which may be used in assays, e.g., as controls, standards, binding or 
hybridization agents, or otherwise. 

It will be recognized that the recombinantly produced telomerase and TRT 
(e.g., hTRT) of the invention will be useful in assays for identification of modulators. The 
screening assay can utilize telomerase or hTRT derived by a full or partial reconstitution of 
telomerase activity, or by an augmentation of existing activity. The assay or screens provided 
by the invention can be used to test for the ability of telomerase to synthesize telomeric DNA 
or to test for any one or all or of the "partial activities" of hTRT and TRTs generally, as 
described above. The assay can incorporate ex vivo modification of cells which have been 
manipulated to express telomerase with or without its RNA moiety or associated proteins, 
and these can be re-implanted into an animal, which can be used for in vivo testing. Thus, 
this invention provides in vivo assays and transgenic animals useful therein. These in vivo 
assays systems can employ "knockoufcells, in which one or several units of the endogenous 
telomerase enzyme complex have been deleted or inhibited, as well as cells in which an 
exogenous or endogenous telomerase activity is reconstituted or activated. 

Telomerases and TRT proteins that have been modified in a site-specific 
manner (by site-specific mutation) to modify or delete any or all functions of the telomerase 
enzyme or the TRT protein can also be employed in the screens of the invention to discover 
therapeutic agents. For example, the TRT can be engineered to lose its ability to bind 
substrate DNA, to bind its RNA moiety (as hTR), to catalyze the addition of telomeric DNA, 
to bind deoxynucleotide substrate, to have nucleolytic activity, to bind telomere-associated 
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proteins or chromosomal structures, and the like. The resulting "mutant proteins" or 
"muteens" can be used to identify compounds that specifically modulate one, several, or all 
functions or activities of the TRT protein or telomerase. 

5 C. EXEMPLARY TELOMERASE MODULATORS 

1) GENERALLY 

The test compounds referred to supra may be any of a large variety of 
compounds, both naturally occurring and synthetic, organic and inorganic, and including 
polymers (e.g., oligopeptides, polypeptides, oligonucleotides, and polynucleotides), small 
10 molecules, antibodies (as broadly defined herein), sugars, fatty acids, nucleotides and 

nucleotide analogs, analogs of naturally occurring structures (e.g., peptide mimetics, nucleic 
acid analogs, and the like), and numerous other compounds. 

The invention provides modulators of all types, without limitation to amy 
particular mechanism of action. For illustrative purposes, examples of modulators include 
1 5 compounds or treatments that: 

(i) bind to the hTRT polypeptide (e.g., the active site of the enzyme) or other 
telomerase component, and affect a telomerase activity; 
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(ii) inhibit or promote association, or inhibit or promote disassociation, of a 
telomerase component (e.g., hTRT or the hTRT-hTR RNP) with or from a 
telomerase-associated protein (e.g., including those described in Section IV(D), 
supra); 



(iii) inhibit or promote association, or inhibit or promote disassociation, of 
telomerase polypeptides (e.g., hTRT) with or from a telomerase RNA (e.g., hTR); 
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(iv) inhibit or promote association, or inhibit or promote disassociation, of 
telomerase polypeptides (e.g., hTRT) with or from chromosomes (e.g., telomeres) or 
chromosomal DNA (e.g. telomeric DNA); 
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(v) increase or decrease expression of a telomerase component gene product 
(e.g., products of the hTRT gene), including change the rate or level of transcription 
of the TRT gene, or translation, transport or stability of a gene product, or the like, by 
binding to the gene or gene product (e.g., by interacting with a factor (e.g., a 
transcription regulatory protein) that affects transcription of the hTRT gene or another 
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telomerase component). 

2) PEPTIDE MODULATORS 

Potential modulators of telomerase activity also include peptides (e.g., 
inhibitory (antagonist) and activator (agonist) peptide modulators). For example, 
oligopeptides with randomly generated sequences can be screened to discover peptide 
modulators (agonists or inhibitors) of telomerase activity. Such peptides can be used directly 
as drugs or to find the orientation or position of a functional group that can inhibit telomerase 
activity that, in turn, leads to design and testing of a small molecule inhibitor, or becomes the 
backbone for chemical modifications that increase pharmacological utility. Peptides can be 
structural mimetics, and one can use molecular modeling programs to design mimetics based 
on the characteristic secondary structure and/or tertiary structure of telomerase enzyme and 
hTRT protein. Such structural mimetics can also be used therapeutically, in vivo, as 
modulators of telomerase activity (agonists and antagonists). Structural mimetics can also be 
used as immunogens to elicit anti-telomerase or anti-TRT protein antibodies. 

3) INHIBITORY NATURAL COMPOUNDS AS MODULATORS OF 
TELOMERASE ACTIVITY 

In addition, a large number of potentially useful activity-modifying 
compounds can be screened in extracts from natural products as a source material. Sources of 
such extracts can be from a large number of species of fungi, actinomyces, algae, insects, 
protozoa, plants, and bacteria. Those extracts showing inhibitory activity can then be 
analyzed to isolate the active molecule. See for example, Turner (1996) J. Ethnopharmacol 
51(l-3):39-43; Suh (1995) Anticancer Res. 15:233-239. 

4) INHIBITORY OLIGONUCLEOTIDES 

One particularly useful set of inhibitors provided by the present invention 
includes oligonucleotides which are able to either bind mRNA encoding hTRT protein or to 
the hTRT gene, in either case preventing or inhibiting the production of functional hTRT 
protein. Other oligonucleotides of the invention interact with telomerase's RNA moiety, such 
as hTR, or are able to prevent binding of telomerase or hTRT to its DNA target, or one 
telomerase component to another, or to a substrate. Such oligonucleotides can also bind the 
telomerase enzyme, hTRT protein, or both protein and RNA and inhibit a partial activity as 
described above (such as its processive activity, its reverse transcriptase activity, its 
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nucleolytic activity, and the like). The association can be through sequence specific 
hybridization to another nucleic acid or by general binding, as in an aptamer, or both. 

Telomerase activity can be inhibited by targeting the hTRT mRNA with 
antisense oligonucleotides capable of binding the hTRT mRNA. 
5 Another useful class of inhibitors includes oligonucleotides which cause 

inactivation or cleavage of hTRT mRNA or hTR. That is, the oligonucleotide is chemically 
modified, or has enzyme activity, which causes such cleavage, such as is the case for a 
ribozyme, an EDTA-tethered oligonucleotide, or a covalently bound oligonucleotide, such as 
a psoralen or other cross-linking reagent bound oligonucleotide. As noted above, one may 

10 screen a pool of many different such oligonucleotides for those with the desired activity. 

Another useful class of inhibitors includes oligonucleotides which bind 
polypeptides. Double- or single-stranded DNA or double- or single-stranded RNA molecules 
that bind to specific polypeptides targets are called "aptamers " The specific oligonucleotide- 
polypeptide association may be mediated by electrostatic interactions. For example, aptamers 

1 5 specifically bind to anion-binding exosites on thrombin, which physiologically binds to the 
polyanionic heparin (Bock (1992) Nature 355:564-566). Because hTRT protein binds both 
hTR and its DNA substrate, and because the present invention provides hTRT and other TRT 
proteins in purified form in large quantities, those of skill in the art can readily screen for 
TRT-binding aptamers using the methods of the invention. 

20 Oligonucleotides (e.g., RNA oligonucleotides) that bind telomerase, hTRT, 

hTR, or portions thereof, can be generated using the techniques of SELEX (Tuerk, 1997, 
Methods Mol Biol 67, 2190). In this technique a very large pool (106-109) of random 
sequence nucleic acids is bound to the target (e.g. hTRT) using conditions that cause a large 
amount of discrimination between molecules with high affinity and low affinity for binding 

25 the target. The bound molecules are separated from unbound, and the bound molecules are 
amplified by virtue of a specific nucleic acid sequence included at their termini and suitable 
amplification reagents. This process is reiterated several times until a relatively small number 
of molecules remain that possess high binding affinity for the target. These molecules can 
then be tested for their ability to modulate telomerase activity as described herein. 

30 Antagonists of telomerase-mediated DNA replication can also be based on 

inhibition of hTR (Norton (1996) Nature Biotechnology 14:615-619) through complementary 
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sequence recognition or cleavage, as through ribozymes. 

The inhibitory oligonucleotides of the invention can be transferred into the cell 
using a variety of techniques well known in the art. For example, oligonucleotides can be 
delivered into the cytoplasm without specific modification. Alternatively, they can be 
delivered by the use of liposomes which fuse with the cellular membrane or are endocytosed, 
i.e., by employing ligands attached to the liposome or directly to the oligonucleotide, that 
bind to surface membrane protein receptors of the cell resulting in endocytosis. 
Alternatively, the cells may be permeabilized to enhance transport of the oligonucleotides 
into the cell, without injuring the host cells. One can use a DNA binding protein, e.g., 
HBGF-1 , known to transport an oligonucleotide into a cell. 

5) INHIBITORY RIBOZYMES 

Ribozymes act by binding to a target RNA through the target RNA binding 
portion of a ribozyme which is held in close proximity to an enzymatic portion of the 
ribozyme that cleaves the target RNA. Thus, the ribozyme recognizes and binds a target RNA 
usually through complementary base-pairing, and once bound to the correct site, acts 
enzymatically to cleave and inactivate the target RNA. Cleavage of a target RNA in such a 
manner will destroy its ability to direct synthesis of an encoded protein if the cleavage occurs 
in the coding sequence. After a ribozyme has bound and cleaved its RNA target, it is typically 
released from that RNA and so can bind and cleave new targets repeatedly. 

6) IDENTIFYING TELOMERASE-ASSOCIATED PROTEINS FOR 

USE AS MODULATORS 

In one embodiment of the invention, telomerase is used to identify telomerase- 
associated proteins, i.e., telomerase accessory proteins which modulate or otherwise 
complement telomerase activity. As noted above, these proteins or fragments thereof can 
modulate function by causing the dissociation or preventing the association of the telomerase 
enzyme complex, preventing the assembly of the telomerase complex, preventing hTRT from 
binding to its nucleic acid complement or to its DNA template, preventing hTRT from 
binding nucleotides, or preventing, augmenting, or inhibiting any one, several or all of the 
partial activities of the telomerase enzyme or hTRT protein, as described above. 

One of skill in the art can use the methods of the invention to identify which 
portions (e.g., domains) of these telomerase-associating proteins contact telomerase. In one 
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embodiment of the invention, these telomerase-associating proteins or fragments thereof are 
used as modulators of telomerase activity. 

7) TELOMERASE- ASSOCIATED PROTEINS AS DOMINANT 
NEGATIVE MUTANTS 

5 In one embodiment of the invention, telomerase-associated proteins are used 

as modulators of telomerase activity. Telomerase-associated proteins include chromosomal 
structures, such as histones, nuclear matrix proteins, cell division and cell cycle control 
proteins, and the like. Other telomerase-associated proteins which can be used as modulators 
for the purpose of the invention include the p80 and p95 proteins and their human homologs, 
10 such as TP 1 and TRF-1 (Chong , 1995, Science 270:1663-1667). In addition, fragments of 
these telomerase-associated proteins can be identified by the skilled artisan in accordance 
with the methods of the invention and used as modulators of telomerase activity. 

8) DOMINANT NEGATIVE MUTANTS 

Eight highly conserved motifs have been identified between TRTs of different 
15 non-human species, as described above (see also Lingner (1997) Science 276:561-567). 

Figure 4 shows a schematic of the human TRT amino acid sequence (from pGRN121) and 
RT motifs as compared to S. pombe Trtlp, Euplotes pl23 and S. cerevisiae Est2 p. The 
present invention provides recombinant and synthetic nucleic acids in which the codons for 
the conserved amino acid residues in each, alone or in conjunction with one or more 
20 additional codons, of all eight of these motifs has been a changed to each of the other codons. 
A variety of the resulting coding sequences express a non-functional hTRT. See, for 
instance, Example 16, Thus, the present invention provides, for example, a wide variety of 
"mutated" telomerase enzymes and TRT proteins which have a partial activity but not full 
activity of telomerase. For example, one such telomerase is able to bind telomeric structures, 
25 but not bind telomerase-associated RNA {i.e., hTR). If expressed at high enough levels, such 
a telomerase mutant can deplete a necessary telomerase component (e.g., hTR) and thereby 
function as an inhibitor of wild-type telomerase activity. A mutated telomerase acting in this 
manner is an antagonist or a so-called "dominant-negative" mutant. 

9) ANTIBODIES 

30 In general, the antibodies of the invention can be used to identify, purify, or 

inhibit any or all activity of telomerase enzyme and hTRT protein. Antibodies can act as 
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antagonists of telomerase activity in a variety of ways, for example, by preventing the 
telomerase complex or nucleotide from binding to its DNA substrates, by preventing the 
components of telomerase from forming an active complex, by maintaining a functional 
(telomerase complex) quaternary structure or by binding to one of the enzyme's active sites or 
5 other sites that have allosteric effects on activity (the different partial activities of telomerase 
are described in detail elsewhere in this specification). 

D) MODULATOR SYNTHESIS 

It is contemplated that the telomerase modulators of the invention will be 
10 made using methods well known in the pharmaceutical arts, including combinatorial methods 
and rational drug design techniques. 

1) COMBINATORIAL CHEMISTRY METHODOLOGY 

The creation and simultaneous screening of large libraries of synthetic 
molecules can be carried out using well-known techniques in combinatorial chemistry, for 
1 5 example, see van Breemen (1997) Anal Chem 69:2 1 59-2 1 64; Lam ( 1 997) Anticancer Drug 
Des 12:145-167 (1997). 

As noted above, combinatorial chemistry methodology can be used to create 
vast numbers of oligonucleotides (or other compounds) that can be rapidly screened for 
specific oligonucleotides (or compounds) that have appropriate binding affinities and 
20 specificities toward any target, such as the TRT proteins of the invention, can be utilized (for 
general background information Gold (1995) J. of Biol Chem. 270:13581-13584). 

2) RATIONAL DRUG DESIGN 

Rational drug design involves an integrated set of methodologies that include 
structural analysis of target molecules, synthetic chemistries, and advanced computational 

25 tools. When used to design modulators, such as antagonists/inhibitors of protein targets, such 
as telomerase enzyme and hTRT protein, the objective of rational drug design is to 
understand a molecule's three-dimensional shape and chemistry. Rational drug design is 
aided by X-ray cry stallographic data or NMR data, which can now be determined for the 
hTRT protein and telomerase enzyme in accordance with the methods and using the reagents 

30 provided by the invention. Calculations on electrostatics, hydrophobicities and solvent 

accessibility is also helpful. See, for example, Coldren (1997) Proc Natl Acad. Sal USA 
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94:6635-6640. 
E) KITS 

5 The invention also provides kits that can be used to aid in determining whether 

a test compound is a modulator of a TRT activity. The kit will typically include one or more 
of the following components: a substantially purified TRT polypeptide or polynucleotide 
(including probes and primers); a plasmid capable of expressing a TRT (e.g., hTRT) when 
introduced into a cell or cell-free expression system; a plasmid capable of expressing a TR 

10 (e.g., hTR) when introduced into a cell or cell-free expression system; cells or cell lines; a 
composition to detect a change in TRT activity; and, an instructional material teaching a 
means to detect and measure a change in the TRT activity, indicating that a change in the 
telomerase activity in the presence of the test compound is an indicator that the test 
compound modulates the telomerase activity, and one or more containers. The kit can also 

1 5 include means, such as TRAP assay reagents or reagents for a quantitative polymerase chain 
reaction assay, to measure a change in TRT activity. The kit may also include instructional 
material teaching a means to detect and measure a change in the TRT activity, indicating that 
a change in the telomerase activity in the presence of the test compound is an indicator that 
the test compound modulates the telomerase activity. 

20 

XL TRANSGENIC ORGANISMS (TELOMERASE KNOCKOUT CELLS AND 
ANIMAL MODELS) 

The invention also provides transgenic non-human multicellular organisms 
(e.g., plants and non-human animals) or unicellular organisms (e.g., yeast) comprising an 
25 exogenous TRT gene sequence, which may be a coding sequence or a regulatory (e.g., 
promoter) sequence. In one embodiment, the organism expresses an exogenous TRT 
polypeptide, having a sequence of a human TRT protein. In a related embodiment, the 
organism also expresses a telomerase RNA component (e.g., hTR). 

The invention also provides unicellular and multicellular organisms (or cells 
30 therefrom) in which at least one gene encoding a telomerase component (e.g., TRT or TR) or 
telomerase-associated protein is mutated or deleted (i.e., in a coding or regulatory region) 
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such that native telomerase is not expressed, or is expressed at reduced levels or with 
different activities when compared to wild-type cells or organisms. Such cells and organisms 
are often referred to as "gene knock-out" cells or organisms. 

The invention further provides cells and organisms in which an endogenous 
telomerase gene (e.g., murine TRT) is either present or optionally mutated or deleted and an 
exogenous telomerase gene or variant (e.g., human TRT) is introduced and expressed. Cells 
and organisms of this type will be useful, for example, as model systems for identifying 
modulators of hTRT activity or expression; determining the effects of mutations in 
telomerase component genes, and other uses such as determining the developmental timing 
and tissue location of telomerase activity (e.g., for assessing when to administer a telomerase 
modulator and for assessing any potential side effects). 

Examples of multicellular organisms include plants, insects, and nonhuman 
animals such as mice, rats, rabbits, monkeys, apes, pigs, and other nonhuman mammals. An 
example of a unicellular organism is a yeast. 

Methods for alteration or disruption of specific genes (e.g., endogenous TRT 
genes) are well known to those of skill, see, e.g., Baudin et al., 1993, Nucl. Acids Res. 
21:3329; Wachef al, 1994, Yeast 10:1793; Rothstein, 1991, Methods Enzymol 194:281; 
Anderson, 1995, Methods Cell Biol. 48:31; Pettitt et al., 1996, Development 122:4149-4157; 
Ramirez-Solis et al., 1993, Methods Enzymol. 225:855; and Thomas et al., 1987, Cell 51:503, 
each of which is incorporated herein by reference in its entirety for all purposes. 

The "knockout" cells and animals of the invention include cells and animals in 
which one or several units of the endogenous telomerase enzyme complex have been deleted 
or inhibited. Reconstitution of telomerase activity will save the cell or animal from 
senescence or, for cancer cells, cell death caused by its inability to maintain telomeres. 
Methods of altering the expression of endogenous genes are well known to those of skill in 
the art. Typically, such methods involve altering or replacing all or a portion of the 
regulatory sequences controlling expression of the particular gene to be regulated. The 
regulatory sequences, e.g., the native promoter can be altered. The conventional technique 
for targeted mutation of genes involves placing a genomic DNA fragment containing the gene 
of interest into a vector, followed by cloning of the two genomic arms associated with the 
targeted gene around a selectable neomycin-resistance cassette in a vector containing 
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thymidine kinase. This "knock-out" construct is then transfected into the appropriate host 
cell, i.e., a mouse embryonic stem (ES) cell, which is subsequently subjected to positive 
selection (using G418, for example, to select for neomycin-resistance) and negative selection 
(using, for example, FIAU to exclude cells lacking thymidine kinase), allowing the selection 
5 of cells which have undergone homologous recombination with the knockout vector. This 
approach leads to inactivation of the gene of interest. See, e.g., U.S. patents 5,464,764; 
5,631,153; 5,487,992; and, 5,627,059. 

"Knocking out" expression of an endogenous gene can also be accomplished 
by the use of homologous recombination to introduce a heterologous nucleic acid into the 

10 regulatory sequences (e.g., promoter) of the gene of interest. To prevent expression of 

functional enzyme or product, simple mutations that either alter the reading frame or disrupt 
the promoter can be suitable. To up-regulate expression, a native promoter can be substituted 
with a heterologous promoter that induces higher levels of transcription. Also, "gene trap 
insertion" can be used to disrupt a host gene, and mouse ES cells can be used to produce 

1 5 knockout transgenic animals, as described for example, in Holzschu (1997) Transgenic Res 6: 
97-106. 

Altering the expression of endogenous genes by homologous recombination 
can also be accomplished by using nucleic acid sequences comprising the structural gene in 
question. Upstream sequences are utilized for targeting heterologous recombination 

20 constructs. Utilizing TRT structural gene sequence information, such as SEQUENCE ID 
NO: 1, one of skill in the art can create homologous recombination constructs with only 
routine experimentation. Homologous recombination to alter expression of endogenous 
genes is described in U.S. Patent 5,272,071, and WO 91/09955, WO 93/09222, WO 
96/2941 1 , WO 95/3 1 560, and WO 91/12650. Homologous recombination in mycobacteria is 

25 described by Azad (1996) Proc. Natl. Acad ScL USA 93:4787; Baulard (1996) J. 

Bacteriol 178:3091; and Pelicic (1996) MoL Microbiol 20:919. Homologous recombination 
in animals has been described by Moynahan (1996) Hum. MoL Genet. 5:875, and in plants by 
Offringa (1990) EMBOJ. 9:3077. 

30 XII. GLOSSARY 

The following terms are defined infra to provide additional guidance to one of 

177 



rim mil i 1 




skill in the practice of the invention: adjuvant, allele (& allelic sequence), amino acids 
(including hydrophobic, polar, charged), conservative substitution, control elements (& 
regulatory sequences), derivatized, detectable label, elevated level, epitope, favorable and 
unfavorable prognosis, fusion protein, gene product, hTR, immortal, immunogen and 
5 immunogenic, isolated, modulator, motif, nucleic acid (& polynucleotide), oligonucleotides 
(& oligomers), operably linked, polypeptide, probe (including nucleic acid probes & antibody 
probes), recombinant, selection system, sequence, specific binding, stringent hybridization 
conditions (& stringency), substantial identity (& substantial similarity), substantially pure (& 
substantially purified), telomerase-negative and telomerase-positive cells, telomerase 
1 0 catalytic activity, telomerase-related, and test compound. 



substance that enhances the immune response to an antigen with which it is mixed. 
Adjuvants useful in the present invention include, but are not limited to, Freund's, mineral 
15 gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and dinitrophenol. 
BCG (Bacillus Calmette-Guerin) and Corynebacterium parvum are potentially useful 
adjuvants. 

20 As used herein, the terms "allele 11 or "allelic sequence" refer to an alternative 

form of a nucleic acid sequence (i.e., a nucleic acid encoding hTRT protein). Alleles result 
from mutations (i.e., changes in the nucleic acid sequence), and generally produce altered 
and/or differently regulated mRNAs or polypeptides whose structure and/or function may or 
may not be altered. Common mutational changes that give rise to alleles are generally 

25 ascribed to natural deletions, additions, or substitutions of nucleotides that may or may not 
affect the encoded amino acids. Each of these types of changes may occur alone, in 
combination with the others, or one or more times within a given gene, chromosome or other 
cellular nucleic acid. Any given gene may have no, one or many allelic forms. As used 
herein, the term "allele" refers to either or both a gene or an mRNA transcribed from the 



As used herein, the term "adjuvant" refers to its ordinary meaning of any 
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As used herein, "amino acids" are sometimes specified using the standard one 
letter code: Alanine (A), Serine (S), Threonine (T), Aspartic acid (D), Glutamic acid (E) 
Asparagine (N), Glutamine (Q), Arginine (R) s Lysine (K), Isoleucine (I), Leucine (L), 
Methionine (M), Valine (V), Phenylalanine (F), Tyrosine (Y)> Tryptophan (W), Proline (P), 
5 Glycine (G), Histidine (H), Cysteine (C). Synthetic and non-naturally occurring amino acid 
analogues (and/or peptide linkages) are included. 

As used herein, "Hydrophobic amino acids" refers to A, L, I, V, P, F, W, and 
M. As used herein, "polar amino acids" refers to G, S, T, Y, C, N, and Q. As used herein, 
10 "charged amino acids" refers to D, E, H, K, and R. 

As used herein,"conservative substitution", when describing a protein refers 
to a change in the amino acid composition of the protein that does not substantially alter the 
protein's activity. Thus, "conservatively modified variations" of a particular amino acid 

15 sequence refers to amino acid substitutions of those amino acids that are not critical for 
protein activity or substitution of amino acids with other amino acids having similar 
properties {e.g., acidic, basic, positively or negatively charged, polar or non-polar, etc.) such 
that the substitutions of even critical amino acids does not substantially alter activity. 
Conservative substitution tables providing functionally similar amino acids are well known in 

20 the art. The following six groups each contain amino acids that are conservative substitutions 
for one another: 1) Alanine (A), Serine (S), Threonine (T); 2) Aspartic acid (D), Glutamic 
acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), 
Leucine (L), Methionine (M), Valine (V); and 6) Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W) (see also, Creighton (1984) Proteins, W.H. Freeman and Company). One of 

25 skill in the art will appreciate that the above-identified substitutions are not the only possible 
conservative substitutions. For example, one may regard all charged amino acids as 
conservative substitutions for each other whether they are positive or negative. In addition, 
individual substitutions, deletions or additions which alter, add or delete a single amino acid 
or a small percentage of amino acids in an encoded sequence can also be "conservatively 

30 modified variations". One can also make a "conservative substitution" in a recombinant 

protein by utilizing one or more codons that differ from the codons employed by the native or 
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wild-type gene. In this instance, a conservative substitution also includes substituting a 
codon for an amino acid with a different codon for the same amino acid. 

As used herein,"control elements" or "regulatory sequences" include 
5 enhancers, promoters, transcription terminators, origins of replication, chromosomal 
integration sequences, 5 1 and 3' untranslated regions, with which proteins or other 
biomolecules interact to carry out transcription and translation. For eukaryotic cells, the 
control sequences will include a promoter and preferably an enhancer, e.g., derived from 
immunoglobulin genes, SV40, cytomegalovirus, and a polyadenylation sequence, and may 
10 include splice donor and acceptor sequences. Depending on the vector system and host 

utilized, any number of suitable transcription and translation elements, including constitutive 
and inducible promoters, may be used 

As used herein, a "derivatized" polynucleotide, oligonucleotide, or nucleic 

15 acid refers to oligo- and polynucleotides that comprise a derivatized substituent. In some 

embodiments, the substituent is substantially non-interfering with respect to hybridization to 
complementary polynucleotides. Derivatized oligo- or polynucleotides that have been 
modified with appended chemical substituents (e.g., by modification of an already 
synthesized oligo- or poly-nucleotide, or by incorporation of a modified base or backbone 

20 analog during synthesis) may be introduced into a metabolically active eukaryotic cell to 

hybridize with an hTRT DNA, RNA, or protein where they produce an alteration or chemical 
modification to a local DNA, RNA, or protein. Alternatively, the derivatized oligo or 
polynucleotides may interact with and alter hTRT polypeptides, telomerase-associated 
proteins, or other factors that interact with hTRT DNA or hTRT gene products, or alter or 

25 modulate expression or function of hTRT DNA, RNA or protein. Illustrative attached 
chemical substituents include: europium (III) texaphyrin, cross-linking agents, psoralen, 
metal chelates (e.g., iron/EDTA chelate for iron catalyzed cleavage), topoisomerases, 
endonucleases, exonucleases, ligases, phosphodiesterases, photodynamic porphyrins, 
chemotherapeutic drugs (e.g., adriamycin, doxirubicin), intercalating agents, base- 

30 modification agents, immunoglobulin chains, and oligonucleotides. Iron/EDTA chelates are 
chemical substituents often used where local cleavage of a polynucleotide sequence is desired 
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(Hertzberg et al., 1982, X Am. Chem. Soc. 104: 313; Hertzberg and Dervan, 1984, 
Biochemistry 23: 3934; Taylor et al., 1984, Tetrahedron 40: 457; Dervan, 1986, Science 232: 
464. Illustrative attachment chemistries include: direct linkage, e.g., via an appended reactive 
amino group (Corey and Schultz (1988) Science 238: 1401, which is incorporated herein by 
reference) and other direct linkage chemistries, although streptavidin/biotin and 
digoxigenin/anti-digoxigenin antibody linkage methods can also be used. Methods for 
linking chemical substituents are provided in U.S. Patents 5,135,720, 5,093,245, and 
5,055,556, which are incorporated herein by reference. Other linkage chemistries may be 
used at the discretion of the practitioner. 

As used herein, a "detectable label" has the ordinary meaning in the art and 
refers to an atom (e.g., radionuclide), molecule (e.g., fluorescein), or complex, that is or can 
be used to detect (e.g., due to a physical or chemical property), indicate the presence of a 
molecule or to enable binding of another molecule to which it is covalently bound or 
otherwise associated. The term "label" also refers to covalently bound or otherwise 
associated molecules (e.g., a biomolecule such as an enzyme) that act on a substrate to 
produce a detectable atom, molecule or complex. Detectable labels suitable for use in the 
present invention include any composition detectable by spectroscopic, photochemical, 
biochemical, immunochemical, electrical, optical or chemical means. Labels useful in the 
present invention include biotin for staining with labeled streptavidin conjugate, magnetic 
beads (e.g., Dynabeads™), fluorescent dyes (e.g., fluorescein, Texas red, rhodamine, green 
fluorescent protein, enhanced green fluorescent protein, lissamine, phycoerythrin, Cy2, Cy3, 
Cy3.5, Cy5, Cy5.5, Cy7, FluorX [Amersham], SyBR Green I & II [Molecular Probes], and 
the like), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, or 32 P), enzymes ( e.g., hydrolases, particularly 
phosphatases such as alkaline phosphatase, esterases and glycosidases, or oxidoreductases, 
particularly peroxidases such as horse radish peroxidase, and others commonly used in 
ELISAs), substrates, cofactors, inhibitors, chemiluminescent groups, chromogenic agents, 
and colorimetric labels such as colloidal gold or colored glass or plastic (e.g., polystyrene, 
polypropylene, latex, etc.) beads. Patents teaching the use of such labels include U.S. Patent 
Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. 
Means of detecting such labels are well known to those of skill in the art. Thus, for example, 
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radiolabels and chemiluminescent labels may be detected using photographic film or 
scintillation counters, fluorescent markers may be detected using a photodetector to detect 
emitted light (e.g., as in fluorescence-activated cell sorting). Enzymatic labels are typically 
detected by providing the enzyme with a substrate and detecting the reaction product 

5 produced by the action of the enzyme on the substrate, and colorimetric labels are detected by 
simply visualizing the colored label. Thus, a label is any composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical 
means. The label may be coupled directly or indirectly to the desired component of the assay 
according to methods well known in the art. Non-radioactive labels are often attached by 

1 0 indirect means. Generally, a ligand molecule (e.g., biotin) is covalently bound to the 

molecule. The ligand then binds to an anti-ligand (e.g., streptavidin) molecule which is either 
inherently detectable or covalently bound to a signal generating system, such as a detectable 
enzyme, a fluorescent compound, or a chemiluminescent compound. A number of ligands 
and anti-ligands can be used. Where a ligand has a natural anti-ligand, for example, biotin, 

15 thyroxine, and Cortisol, it can be used in conjunction with the labeled, naturally occurring 

anti-ligands. Alternatively, any haptenic or antigenic compound can be used in combination 
with an antibody. The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fluorophore. Means of detecting labels 
are well known to those of skill in the art. Thus, for example, where the label is a radioactive 

20 label, means for detection include a scintillation counter, photographic film as in 

autoradiography, or storage phosphor imaging. Where the label is a fluorescent label, it may 
be detected by exciting the fluorochrome with the appropriate wavelength of light and 
detecting the resulting fluorescence. The fluorescence may be detected visually, by means of 
photographic film, by the use of electronic detectors such as charge coupled devices (CCDs) 

25 or photomultipliers and the like. Similarly, enzymatic labels may be detected by providing 
the appropriate substrates for the enzyme and detecting the resulting reaction product Also, 
simple colorimetric labels may be detected by observing the color associated with the label. 
It will be appreciated that when pairs of fluorophores are used in an assay, it is often preferred 
that the they have distinct emission patterns (wavelengths) so that they can be easily 

30 distinguished. 



182 



The phrase "elevated level" refers to an amount of hTRT gene product (or 
other specified substance or activity) in a cell that is elevated or higher than the level in a 
reference standard, e.g., for diagnosis, the level in normal, telomerase-negative cells in an 
individual or in other individuals not suffering from the condition, and for prognosis, the 
5 level in tumor cells from a variety of grades or classes of, e.g., tumors. 

As used herein, the term "epitope" has its ordinary meaning of a site on an 
antigen recognized by an antibody. Epitopes are typically segments of amino acids which are 
a small portion of the whole protein. Epitopes may be conformational (/.e., discontinuous). 
1 0 That is, they may be formed from amino acids encoded by noncontiguous parts of a primary 
sequence that have been juxtaposed by protein folding. 

The terms "favorable prognosis" and "unfavorable prognosis" are known in 
the art. In the context of cancers, "favorable prognosis" means that there is a likelihood of 
1 5 tumor regression or longer survival times for patients with a favorable prognosis relative to 
those with unfavorable prognosis, whereas "unfavorable prognosis" means that the tumor is 
likely to be more aggressive, i.e., grow faster and/or metastasize, resulting in a poor outcome 
or a more rapid course of disease progression for the patient. 

20 As used herein, the term "fusion protein," refers to a composite protein, i.e., a 

single contiguous amino acid sequence, made up of two (or more) distinct, heterologous 
polypeptides which are not normally fused together in a single amino acid sequence. Thus, a 
fusion protein may include a single amino acid sequence that contains two entirely distinct 
amino acid sequences or two similar or identical polypeptide sequences, provided that these 

25 sequences are not normally found together in the same configuration in a single amino acid 
sequence found in nature. Fusion proteins may generally be prepared using either 
recombinant nucleic acid methods, i.e., as a result of transcription and translation of a 
recombinant gene fusion product, which fusion comprises a segment encoding a polypeptide 
of the invention and a segment encoding a heterologous protein, or by chemical synthesis 

30 methods well known in the art. The non-hTRT region(s) of the fusion protein can be fused to 
the amino terminus of the hTRT polypeptide or the carboxyl terminus, or both or the non- 
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hTRT region can be inserted into the interior of the protein sequence (by moiety inserting or 
by replacing amino acids) or combinations of the foregoing can be performed. 

As used herein, the term "gene product" refers to an RNA molecule 
transcribed from a gene, or a protein encoded by the gene or translated from the RNA. 

As used herein, "hTR" (human telomerase RNA) refers to the RNA 
component of human telomerase and any naturally occurring alleles and variants or 
recombinant variants. hTR is described in detail in U.S. Patent No. 5,583,016 which is 
incorporated herein by reference in its entirety and for all purposes. 

As used herein, the term "immortal," when referring to a cell, has its normal 
meaning in the telomerase art and refers to cells that have apparently unlimited replicative 
potential. Immortal can also refer to cells with increased proliferative capacity relative to 
their unmodified counterparts. Examples of immortal human cells are malignant tunior cells, 
germ line cells, and certain transformed human cell lines cultured in vitro (e.g., cells that have 
become immortal following transformation by viral oncogenes or otherwise). In contrast, 
most normal human somatic cells are mortal, i.e., have limited replicative potential and 
become senescent after a finite number of cell divisions. 

As used herein, the terms "immunogen" and "immunogenic" have their 
ordinary meaning in the art, Le, an immunogen is a molecule, such as a protein or other 
antigen, that can elicit an adaptive immune response upon injection into a person or an 
animal. 

As used herein, "isolated," when referring to a molecule or composition, such 
as, for example, an RNP (e.g., at least one protein and at least one RNA), means that the 
molecule or composition is separated from at least one other compound, such as a protein, 
other RNAs, or other contaminants with which it is associated in vivo or in its naturally 
occurring state. Thus, an RNP is considered isolated when the RNP has been isolated from 
any other component with which it is naturally associated, e.g., cell membrane, as in a cell 
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extract. An isolated composition can, however, also be substantially pure. 

As used herein, "modulator" refers to any synthetic or natural compound or 
composition that can change in any way either or both the "full" or any "partial activity" of a 
telomerase reverse transcriptase (TRT). A modulator can be an agonist or an antagonist. A 
modulator can be any organic and inorganic compound; including, but not limited to, for 
example, small molecules, peptides, proteins, sugars, nucleic acids, fatty acids and the like. 

As used herein," motif 1 refers to a sequence of contiguous amino acids (or to a 
nucleic acid sequence that encodes a sequence of contiguous amino acids) that defines a 
feature or structure in a protein that is common to or conserved in all proteins of a defined 
class or type. The motif or consensus sequence may include both conserved and 
non-conserved residues. The conserved residues in the motif sequence indicate that the 
conserved residue or class (i.e., hydrophobic, polar, non-polar, or other class) of residues is 
typically present at the indicated location in each protein (or gene or mRNA) of the class of 
proteins defined by the motif. Motifs can differ in accordance with the class of proteins. 
Thus, for example, the reverse transcriptase enzymes form a class of proteins than can be 
defined by one or more motifs, and this class includes telomerase enzymes. However, the 
telomerase enzymes can also be defined as the class of enzymes with motifs characteristic for 
that class. Those of skill recognize that the identification of a residue as a conserved residue 
in a motif does not mean that every member of the class defined by the motif has the 
indicated residue (or class of residues) at the indicated position, and that one or more 
members of the class may have a different residue at the conserved position. 

As used herein, the terms "nucleic acid" and "polynucleotide" are used 
interchangeably. Use of the term "polynucleotide" is not intended to exclude 
oligonucleotides (i.e., short polynucleotides) and can also refer to synthetic and/or non- 
naturally occurring nucleic acids (i.e., comprising nucleic acid analogues or modified 
backbone residues or linkages). 

As used herein "oligonucleotides" or "oligomers" refer to a nucleic acid 
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sequence of approximately 7 nucleotides or greater, and as many as approximately 100 
nucleotides, which can be used as a primer, probe or amplimer. Oligonucleotides are often 
between about 10 and about 50 nucleotides in length, more often between about 14 and about 
35 nucleotides, very often between about 15 and about 25 nucleotides, and the terms 
oligonucleotides or oligomers can also refer to synthetic and/or non-naturally occurring 
nucleic acids (i.e., comprising nucleic acid analogues or modified backbone residues or 
linkages). 

As used herein, the term "operably linked," refers to a functional relationship 
between two or more nucleic acid (e.g., DNA) segments: for example, a promoter or enhancer 
is operably linked to a coding sequence if it stimulates the transcription of the sequence in an 
appropriate host cell or other expression system. Generally, sequences that are operably 
linked are contiguous, and in the case of a signal sequence both contiguous and in reading 
phase. However, enhancers need not be located in close proximity to the coding sequences 
whose transcription they enhance. 

As used herein, the term "polypeptide" is used interchangeably herein with 
the term "protein," and refers to a polymer composed of amino acid residues linked by amide 
linkages, including synthetic, naturally-occurring and non-naturally occurring analogs thereof 
(amino acids and linkages). Peptides are examples of polypeptides. 

As used herein, a "probe" refers to a molecule that specifically binds another 
molecule. One example of a probe is a "nucleic acid probe" that specifically binds (i.e., 
anneals or hybridizes) to a substantially complementary nucleic acid. Another example of a 
probe is an "antibody probe" that specifically binds to a corresponding antigen or epitope. 

As used herein, "recombinant" refers to a polynucleotide synthesized or 
otherwise manipulated in vitro (e.g., "recombinant polynucleotide"), to methods of using 
recombinant polynucleotides to produce gene products in cells or other biological systems, or 
to a polypeptide ("recombinant protein") encoded by a recombinant polynucleotide. 
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As used herein, a "selection system," in the context of stably transformed cell 
lines, refers to a method for identifying and/or selecting cells containing a recombinant 
nucleic acid of interest. A large variety of selection systems are known for identification of 
transformed cells and are suitable for use with the present invention. For example, cells 
transformed by plasmids or other vectors can be selected by resistance to antibiotics conferred 
by genes contained on the plasmids, such as the well known amp, gpt, neo and hyg genes, or 
other genes such as the herpes simplex virus thymidine kinase (Wigler et al., Cell 1 1 ::223-32 
[1977]) and adenine phosphoribosyltransferase (Lowy et al., Cell 22:817 [1980]) genes which 
can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, antibiotic or 
herbicide resistance can be used as the basis for selection; for example, dhfr which confers 
resistance to methotrexate and is also useful for gene amplification (Wigler et al., Proc. Natl. 
Acad. Scl, 77:3567 [1980]); npt, which confers resistance to the aminoglycosides neomycin 
and G-418 (Colbere-Garapin et al., J. Mol. Biol, 150:1 [1981]) and ah or pat, which confer 
resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, in 
McGraw Hill Yearbook of Science and Technology, McGraw Hill, New York NY, pp 
191-196, [1992]). Additional selectable genes have been described, for example, hygromycin 
resistance-conferring genes, trpB, which allows cells to utilize indole in place of tryptophan, 
or hisD, which allows cells to utilize histinol in place of histidine (Hartman and Mulligan, 
Proc. Natl. Acad. Sci., 85:8047 [1988]). Recently, the use of visible markers has gained 
popularity with such markers as anthocyanins, beta-glucuronidase and its substrate, GUS, and 
luciferase and its substrate, luciferin, being widely used not only to identify transformants, 
but also to quantify the amount of transient or stable protein expression attributable to a 
specific vector system (Rhodes et al., Meth. Mol. Biol, 55:121 [1995]). 

As used herein, the "sequence" of a gene (unless specifically stated 
otherwise), nucleic acid, protein, or peptide refers to the order of nucleotides in either or both 
strands of a double-stranded DNA molecule, e.g., the sequence of both the coding strand and 
its complement, or of a single-stranded nucleic acid molecule, or to the order of amino acids 
in a peptide or protein. 

As used herein, "specific binding" refers to the ability of one molecule, 
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typically an antibody or polynucleotide, to contact and associate with another specific 
molecule even in the presence of many other diverse molecules. For example, a single- 
stranded polynucleotide can specifically bind to a single-stranded polynucleotide that is 
complementary in sequence, and an antibody specifically binds to (or "is specifically 
5 immunoreactive with") its corresponding antigen. 

As used herein, "stringent hybridization conditions" or "stringency" refers 
to conditions in a range from about 5 °C to about 20°C or 25 °C below the melting 
temperature (T m ) of the target sequence and a probe with exact or nearly exact 

10 complementarity to the target. As used herein, the melting temperature is the temperature at 
which a population of double-stranded nucleic acid molecules becomes half-dissociaited into 
single strands. Methods for calculating the T m of nucleic acids are well known in the art (see, 
e.g., Berger and Kimmel (1987) Methods in Enzymology, Vol. 152: Guide to 
Molecular Cloning Techniques, San Diego: Academic Press, Inc. and Sambrook et al. 

15 (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring 
Harbor Laboratory hereinafter, "Sambrook"), both incorporated herein by reference). As 
indicated by standard references, a simple estimate of the T m value may be calculated by the 
equation: T m - 81.5 + 0.4 1(% G + C), when a nucleic acid is in aqueous solution at 1 M 
NaCl {see e.g., Anderson and Young, Quantitative Filter Hybridization in Nucleic Acid 

20 Hybridization (1985)). Other references include more sophisticated computations which 
take structural as well as sequence characteristics into account for the calculation of T m . The 
melting temperature of a hybrid (and thus the conditions for stringent hybridization) is 
affected by various factors such as the length and nature (DNA, RNA, base composition) of 
the probe and nature of the target (DNA, RNA, base composition, present in solution or 

25 immobilized, and the like), and the concentration of salts and other components (e.g., the 

presence or absence of formamide, dextran sulfate, polyethylene glycol). The effects of these 
factors are well known and are discussed in standard references in the art, e.g., Sambrook, 
supra and Ausubel et al. supra. Typically, stringent hybridization conditions are salt 
concentrations less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion at 

30 pH 7.0 to 8.3, and temperatures at least about 30 °C for short probes (e.g., 10 to 50 

nucleotides) and at least about 60 °C for long probes (e.g., greater than 50 nucleotides). As 
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noted, stringent conditions may also be achieved with the addition of destabilizing agents 
such as formamide, in which case lower temperatures may be employed. 

As used herein, the term "substantial identity," "substantial sequence 
5 identity," or "substantial similarity" in the context of nucleic acids, refers to a measure of 
sequence similarity between two polynucleotides. Substantial sequence identity can be 
determined by hybridization under stringent conditions, by direct comparison, or other 
means. For example, two polynucleotides can be identified as having substantial sequence 
identity if they are capable of specifically hybridizing to each other under stringent 
1 0 hybridization conditions. Other degrees of sequence identity (e.g., less than "substantial") 

can be characterized by hybridization under different conditions of stringency. Alternatively, 
substantial sequence identity can be described as a percentage identity between two 
nucleotide (or polypeptide) sequences. Two sequences are considered substantially identical 
when they are at least about 60% identical, preferably at least about 70% identical, or at least 
1 5 about 80% identical, or at least about 90% identical, or at least about 95% or 98% to 100% 
identical. Percentage sequence (nucleotide or amino acid) identity is typically calculated by 
determining the optimal alignment between two sequences and comparing the two sequences. 
For example an exogenous transcript used for protein expression can be described as having a 
certain percentage of identity or similarity compared to a reference sequence (e.g., the 
20 corresponding endogenous sequence). Optimal alignment of sequences may be conducted 

using the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2: 482, 
by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48: 
443, by the search for similarity method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. 
U.S.A. 85: 2444, by computerized implementations of these algorithms (GAP, BESTFIT, 
25 FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 

Group, 575 Science Dr., Madison, WI), or by inspection. The best alignment (i.e., resulting 
in the highest percentage of identity) generated by the various methods is selected. Typically 
these algorithms compare the two sequences over a "comparison window" (usually at least 18 
nucleotides in length) to identify and compare local regions of sequence similarity, thus 
30 allowing for small additions or deletions (i.e., gaps). Additions and deletions are typically 20 
percent or less of the length of the sequence relative to the reference sequence, which does 
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not comprise additions or deletions. It is sometimes desirable to describe sequence identity 
between two sequences in reference to a particular length or region (e.g., two sequences may 
be described as having at least 95% identity over a length of at least 500 basepairs). Usually 
the length will be at least about 50, 100, 200, 300, 400 or 500 basepairs, amino acids, or other 
5 residues. The percentage of sequence identity is calculated by comparing two optimally 
aligned sequences over the region of comparison, determining the number of positions at 
which the identical nucleic acid base (e.g., A, T, C, G, or U) occurs in both sequences to yield 
the number of matched positions, and determining the number (or percentage) of matched 
positions as compared to the total number of bases in the reference sequence or region of 

10 comparison. An additional algorithm that is suitable for determining sequence similarity is 
the BLAST algorithm, which is described in Altschul (1990) J. Mol. Biol 215: 403-410; and 
Shpaer (1996) Genomics 38:179-191. Software for performing BLAST analyses is publicly 
available at the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

1 5 sequence pairs (HSPs) by identifying short words of length W in the query sequence that 

either match or satisfy some positive- valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et al 9 supra,). These initial neighborhood word hits act as seeds for 
initiating searches to find longer HSPs containing them. The word hits are extended in both 

20 directions along each sequence for as far as the cumulative alignment score can be increased. 
Extension of the word hits in each direction are halted when: the cumulative alignment score 
falls off by the quantity X from its maximum achieved value; the cumulative score goes to 
zero or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 

25 determine the sensitivity and speed of the alignment. The BLAST program uses as defaults a 
wordlength (W) of 1 1, the BLOSUM62 scoring matrix (see Henikoff (1992) Proc. Natl 
Acad Scl USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N--4, 
and a comparison of both strands. The term BLAST refers to the BLAST algorithm which 
performs a statistical analysis of the similarity between two sequences; see, e.g., Karlin 

30 (1993) Proc. Natl Acad Scl USA 90:5873-5787. One measure of similarity provided by the 
BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the 
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probability by which a match between two nucleotide or amino acid sequences would occur 
by chance. For example, a nucleic acid can be considered similar to a TRT nucleic acid if the 
smallest sum probability in a comparison of the test nucleic acid to an TRT nucleic acid is 
less than about 0,5, 0.2, 0.1, 0.01, or 0.001. Alternatively, another indication that two nucleic 
5 acid sequences are similar is that the polypeptide which the first nucleic acid encodes is 

immunologically cross reactive with the polypeptide encoded by the second nucleic acid. It 
will be recognized that homologous non-human TRT polynucleotides may have less that 
"substantial" nucleotide identity in certain regions, as the term "substantial identity" is 
defined herein. For example, Euplotes TRT is substantially less than about 60% identical to 
10 the hTRT polynucleotide of Seq. ID. No. 1 in certain regions, although the two genes are 
homologs. 

As used herein, the terms "substantial identity," "substantial sequence 
identity, "or "substantial similarity" in the context of a polypeptide, refers to a degree of 

15 similarity between two polypeptides in which a polypeptides comprises a sequence with at 
least 70% sequence identity to a reference sequence, or 80%, or 85% or up to 100% sequence 
identity to the reference sequence, or most preferably 90% identity over a comparison 
window of about 10-20 amino acid residues. Amino acid sequence similarity, or sequence 
identity, is determined by optimizing residue matches, if necessary, by introducing gaps as 

20 required. See Needleham et aL (1970) J. Mol Biol 48: 443-453; and Sankoff et aL, 1983, 
Time Warps, String Edits, and Macromolecules, The Theory and Practice of Sequence 
Comparison, Chapter One, Addison- Wesley, Reading, MA; and software packages from 
IntelliGenetics, Mountain View, CA, and the University of Wisconsin Genetics Computer 
Group, Madison, WL As will be apparent to one of skill, the terms "substantial identity", 

25 "substantial similarity" and "substantial sequence identity" can be used interchangeably with 
regard to polypeptides or polynucleotides. It will be recognized that homologous non-human 
TRT polypeptides may have less that "substantial" sequence identity in certain regions, as the 
term "substantial identity" is defined herein. For example, Euplotes TRT protein is 
substantially less than about 60% identical to the hTRT polynucleotide of Seq. ID. No* 2 in 

30 certain regions, although the two genes are homologs. In the context of TRT polypeptides 
from different species, for example, "significant homology" at the amino acid sequence 
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means at least about 20% sequence identity in region of about 20 to about 40 residues, or at 
least about 40% sequence identity in region of at least about 20% sequence identity. 

As used herein, the term "substantially pure," or "substantially purified," 

5 when referring to a composition comprising a specified reagent, such as an antibody (e.g. an 
anti-hTRT antibody), means that the specified reagent is at least about 75%, or at least about 
90%, or at least about 95%, or at least about 99% or more of the composition (not including, 
e.g., solvent or buffer). Thus, for example, a preferred immunoglobulin preparation of the 
invention that specifically binds an hTRT polypeptide is substantially purified. 

10 

As used herein, a "telomerase negative" cell is one in which telomerase is not 
expressed, i.e., no telomerase catalytic activity can be detected using a conventional assay or 
a TRAP assay for telomerase catalytic activity. As used herein, a "telomerase positive" cell 
is a cell in which telomerase is expressed (i.e. telomerase activity can be detected). 

15 

As used herein, a "telomerase-related" disease or condition is a disease or 
condition in a subject that is correlated with an abnormally high level of telomerase activity 
in cells of the individual, which can include any telomerase activity at all for most normal 
somatic cells, or which is correlated with a low level of telomerase activity that results in 
20 impairment of a normal cell function. Examples of telomerase-related conditions include, 
e.g., cancer (high telomerase activity in malignant cells) and infertility (low telomerase 
activity in germ-line cells). 

As used herein, "test compound" or "agent" refers to any synthetic or natural 
25 compound or composition. The term includes all organic and inorganic compounds; 

including, for example, small molecules, peptides, proteins, sugars, nucleic acids, fatty acids 
and the like. 



XIII. EXAMPLES 

30 The following examples are provided to illustrate the present invention, and 

not by way of limitation. 
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In the following sections, the following abbreviations apply: eq (equivalents); 
M (Molar); \xM (micromolar); N (Normal); mol (moles); mmol (millimoles); jimol 
(micromoles); nmol (nanomoles); g (grams); mg (milligrams); fig (micrograms); ng 
(nanograms); 1 or L (liters); ml (milliliters); juil (microliters); cm (centimeters); mm 
5 (millimeters); Jim (micrometers); nm (nanometers); °C (degrees Centigrade); RPN 

(ribonucleoprotein); mreN (2-0-methylribonucleotides); dNTP (deoxyribonucleotide); dH 2 0 
(distilled water); DDT (dithiothreitol); PMSF (phenylmethylsulfonyl fluoride); TE (10 mM 
Tris HO, 1 mM EDTA, approximately pH 7.2); KGlu (potassium glutamate); SSC (salt and 
sodium citrate buffer); SDS (sodium dodecyl sulfate); PAGE (polyacrylamide gel 

10 electrophoresis); Novex (Novex, San Diego, CA); BioRad (Bio-Rad Laboratories, Hercules, 
CA); Pharmacia (Pharmacia Biotech, Piscataway, NT); Boehringer-Mannheim (Boehringer- 
Mannheim Corp., Concord, CA); Amersham (Amersham, Inc., Chicago, IL); Stratagene 
(Stratagene Cloning Systems, La Jolla, CA); NEB (New England Biolabs, Beverly, MA); 
Pierce (Pierce Chemical Co., Rockford, IL); Beckman (Beckman Instruments, Fullerton, 

15 CA); Lab Industries (Lab Industries, Inc., Berkeley, CA); Eppendorf (Eppendorf Scientific. 
Madison, WI); and Molecular Dynamics (Molecular Dynamics, Sunnyvale, CA). 

EXAMPLE 1 

ISOLATION OF TELOMERASE PROTEINS AND CLONES 

20 The following example details the isolation of telomerase proteins and clones 

from various organisms, including the euplotes p. 123, hTRT, TRT and S. pombe TRT 
telomerase cDNA clones. 
A. Background 

i) Introduction 

25 This section provides an overview of the purification and cloning of TRT 

genes, which is described in greater detail in subsequent sections of this Example. While 
telomerase RNA subunits have been identified in ciliates, yeast and mammals, protein 
subunits of the enzyme have not been identified as such prior to the present invention. 
Purification of telomerase from the ciliated protozoan Euplotes aediculatus yielded two 

30 proteins, termed pl23 and p43 (see infra; Lingner (1996) Proa Natl Acad. Scl U.S.A. 

93:10712). Euplotes aediculatus is a hypotrichous ciliate having a macronucleus containing 
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about 8 x 10 7 telomeres and about 3 x 10 5 molecules of telomerase. After purification, the 
active telomerase complex had a molecular mass of about 230 kD, corresponding to a 66 kD 
RNA subunit and two proteins of about 123 kD and 43 kD (Lingner (1996) supra). 
Photocross-linking experiments indicated that the larger pi 23 protein was involved in 
5 specific binding of the telomeric DNA substrate (Lingner, (1996) supra). 

The pi 23 and p43 proteins were sequenced and the cDNA clones which 
encoded these proteins were isolated. These Euplotes sequences were found to be unrelated to 
the Tetrahymena telomerase-associated proteins p80 and p95. Sequence analysis of the 
Euplotes pi 23 revealed reverse transcriptase (RT) motifs. Furthermore, sequence analysis of 

10 the Euplotes pi 23 by comparison to other sequences revealed a yeast homolog, termed Est2 
protein (Lingner (1997) Science 276:561). Yeast Est2 had previously been shown to be 
essential for telomere maintenance in vivo (Lendvay (1996) Genetics 144:1399) but had not 
been identified as a telomerase catalytic protein. Site-specific mutagenesis demonstrated that 
the RT motifs of yeast Est2 are essential for telomeric DNA synthesis in vivo and in vitro 

15 (Lingner (1997) supra). 

ii) Identifying and Characterizing S. pombe Telomerase 
PCR amplification of S. pombe DNA was carried out with degenerate 
sequence primers designed from the Euplotes pi 23 RT motifs as described below. Of the 
four prominent PCR products generated, a 120 base pair band encoded a peptide sequence 

20 homologous to pi 23 and Est2. This PCR product was used as a probe in colony 

hybridization and identified two overlapping clones from an S. pombe genomic library and 
three from an S. pombe cDNA library. Sequence analysis revealed that none of the three S. 
pombe cDNA clones was full length, so RT-PCR was used to obtain the sequences encoding 
the proteiris N-terminus. 

25 Complete sequencing of these clones revealed a putative S. pombe telomerase 

RT gene, trtl. The complete nucleotide sequence of trt\ has been deposited in GenBank, 

accession number AF01 5783 (see Figure 15). 

To test S. pombe trtl (as a catalytic subunit, two deletion constructs were 

created. Analysis of the sequence showed that trtl encoded a basic protein with a predicted 
30 molecular mass of 1 1 6 kD. It was found that homology with pi 23 and Est2 was especially 

high in the seven reverse transcriptase motifs, underlined and designated as motifs 1, 2, A, B, 
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C, D, and E (see Figure 63). An additional telomerase-specific motif, designated the T- 
motif, was also found. Fifteen introns, ranging in size from 36 to 71 base pairs, interrupted 
the coding sequence. 

To test S. pombe trt\ as a catalytic subunit, two deletion constructs were 
5 created. One removed only motifs B through D in the RT domains. The second removed 
99% of the open reading frame. 

Haploid cells grown from & pombe spores of both mutants showed 
progressive telomere shortening to the point where hybridization to telomeric repeats became 
almost undetectable. A trtY/ trtY diploid was sporulated and the resulting tetrads were 

10 dissected and germinated on a yeast extract medium supplemented with amino acids (a YES 
plate, Alfa (1993) Experiments with Fission Yeasty Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY). Colonies derived from each spore were grown at 32°C for three 
days, and streaked successively to fresh YES plates every three days. A colony from each 
round was placed in six ml of YES liquid culture at 32°C and grown to stationary phase. 

15 Genomic DNA was prepared. After digestion with Apal, DNA was subjected to 
electrophoresis on a 2.3% agarose gel, stained with ethidium bromide to confirm 
approximately equal loading in each lane, then transferred to a nylon membrane and 
hybridized to a telomeric DNA probe. 

Senescence was indicated by the delayed onset of growth or failure to grow on 

20 agar (typically at the fourth streak-out after germination) and by colonies with increasingly 

ragged edges (colony morphology shown in Figure 22C) and by increasingly high fractions of 
elongated cells (as shown in Figure 22D). Cells were plated on Minimal Medium (Alfa 
(1993) supra) with glutamic acid substituted for ammonium chloride for two days at 32°C 
prior to photography. 

25 When individual enlarged cells were separated on the dissecting microscope, 

the majority were found to undergo no further division. The same telomerase negative (trtV) 
cell population always contained normal-sized cells which continued to divide, but which 
frequently produced non-dividing progeny. The telomerase-negative survivors may use a 
recombinational mode of telomere maintenance as documented in budding yeast strains that 

30 have various telomere-replication genes deleted (Lendvay (1996) supra, Lundblad (1993) 
Cell 73:347). 
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iii) Identifying and Characterizing Human Telomerase 

An EST (expressed sequence tag) derived from human telomerase reverse 
transcriptase (hTRT) cDNA was identified by a BLAST search of the dbEST (expressed 
sequence tag) Genbank database using the Euplotes 123 kDa peptide and nucleic acid 

5 sequences, as well as the Schizosaccharomyces protein and corresponding cDN A (tez 1 ) 
sequences. The EST, designated Genbank AA28196, is 389 nucleotides long and it 
corresponds to positions 1679 to 2076 of clone 712562 (Figure 18), was obtained from the 
I.M.A.G.E. Consortium (Human Genome Center, DOE, Lawrence Livermore National 
Laboratory, Livermore, CA). This clone was obtained from a cDNA library of germinal B 

1 0 cells derived by flow sorting of tonsil cells. Complete sequencing of this hTRT cDN A clone 
showed all eight telomerase RT (TRT) motifs. However, this hTRT clone did not encode a 
contiguous portion of a TRT because RT motifs B', C, D, and E, were contained in a different 
open reading frame than the more N-terminal RT motifs. In addition, the distance between 
RT motifs A and B was substantially shorter than that of the three previously known (non- 

15 human) TRTs. 

To isolate a full length cDNA clone, a cDNA library derived form the human 
293 cell line (described above) which expresses high levels of telomerase activity, was 
screened. A lambda cDNA library from the 293 cell line was partitioned into 25 pools 
containing about 200,000 plaques each. Each pool was screened by PCR with the primer pair 

20 5--CGGAAGAGTGTCTGGAGCAA-3' and 5'-GGATGAAGCGGAGTCTGGA-3'. Six 

subpools of one positive primary pool were further screened by PCR using this same primer 
pair. For both the primary and the secondary subpool screening, hTRT was amplified for a 
total of 31 cycles at: 94°C, 45 seconds; 60°C, 45 seconds; and 72°C, 90 seconds. As a 
control, RNA of the house-keeping enzyme GAPDH was amplified using the primer pair 5'- 

25 CTCAGACACCA 

TGGGGAAGGTGA-3' and 5'- ATGATCTTG AGGCTGTTGTC AT A-3 ' for a total of 16 
cycles at 94°C, 45 seconds; 55°C, 45 seconds; and 72°C, 90 seconds. 

One hTRT positive subpool from the secondary screening was then screened 
by plaque hybridization with a probe from the 5' region of clone #712562. One phage was 

30 positively identified (designated Lambda phage 25-1 .1, ATCC 209024, deposited May 12, 
1997). It contained an approximately four kilobase insert, which was excised and subcloned 
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into the EcoRI site of pBluescript II SK+ vector (Stratagene, San Diego, CA) as an EcoRI 
fragment. This cDNA clone-containing plasmid was designated pGRN121. ThecDNA 
insert totals approximately 4 kilobasepairs. The complete nucleotide sequence of the human 
hTRT cDNA (pGRN121) has been deposited in Genbank (accession AFO 15950) and the 
5 plasmid has been deposited with the ATCC (ATCC 209016, deposited May 6, 1997).. 

B. Growth of Euplotes aediculatus 

In this Example, cultures of E. aediculatus were obtained from Dr. David 
Prescott, MCDB, University of Colorado. Dr. Prescott originally isolated this culture from 
10 pond water, although this organism is also available from the ATCC (ATCC #30859). 

Cultures were grown as described by S wanton et al, (S wanton et al, Chromosoma 77:203 
[1980]), under non-sterile conditions, in 15-liter glass containers containing Chlorogonium as 
a food source. Organisms were harvested from the cultures when the density reached 
approximately 10 4 cells/ml 

15 

C. Preparation of Nuclear Extracts 

In this Example, nuclear extracts of E. aediculatus were prepared using the 
method of Lingner et al t (Lingner et al, Genes Develop., 8:1984 [1994]), with minor 
modifications, as indicated below. Briefly, cells grown as described in Part B were 

20 concentrated with 1 5 ^im Nytex filters and cooled on ice. The cell pellet was resuspended in 
a final volume of 1 10 ml TMS/PMSF/spermidine phosphate buffer. The stock 
TMS/PMSF/spermidine phosphate buffer was prepared by adding 0.075 g spermidine 
phosphate (USB) and 0.75 ml PMSF (from 100 mM stock prepared in ethanol) to 150 ml 
TMS. TMS comprised 10 mM Tris-acetate, 10 mM MgCl 2 , 85.5752 g sucrose/liter, and 

25 0.33297 g CaCl 2 /liter, pH 7.5. 

After resuspension in TMS/PMSF/spermidine phosphate buffer, 8.8 ml 10% 
NP-40 and 94.1 g sucrose were added and the mixture placed in a siliconized glass beaker 
with a stainless steel stirring rod attached to an overhead motor. The mixture was stirred until 
the cells were completely lysed (approximately 20 minutes). The mixture was then 

30 centrifuged for 10 minutes at 7500 rpm (8950 x g), at 4°C, using a Beckman JS-13 swing-out 
rotor. The supernatant was removed and nuclei pellet was resuspended in 
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TMS/PMSF/spermidine phosphate buffer, and centrifuged again, for 5 minutes at 7500 rpm 
(8950 x g), at 4°C, using a Beckman JS-13 swing-out rotor. 

The supernatant was removed and the nuclei pellet was resuspended in a 
buffer comprised of 50 mM Tris-acetate, 10 mM MgCl 2 , 10% glycerol, 0.1% NP-40, 0.4 M 
KGlu, 0.5 mM PMSF, pH 7.5, at a volume of 0.5 ml buffer per 10 g of harvested cells. The 
resuspended nuclei were then dounced in a glass homogenizer with approximately 50 strokes, 
and then centrifuged for 25 minutes at 14,000 rpm at 4°C, in an Eppendorf centrifuge. The 
supernatant containing the nuclear extract was collected, frozen in liquid nitrogen, and stored 
at -80 °C until used 

D. Purification of Telomerase 

In this Example, nuclear extracts prepared as described in Part C were used to purify 

E. aediculatus telomerase. In this purification protocol, telomerase was first enriched by 
chromatography on an Affi-Gel-heparin column, and then extensively purified by affinity 
purification with an antisense oligonucleotide. As the template region of telomerase RNA is 
accessible to hybridization in the telomerase RNP particle, an antisense oligonucleotide (i.e., 
the "affinity oligonucleotide") was synthesized that was complementary to this template 
region as an affinity bait for the telomerase. A biotin residue was included at the 5' end of the 
oligonucleotide to immobilize it to an avidin column. 

Following the binding of the telomerase to the oligonucleotide, and extensive 
washing, the telomerase was eluted by use of a displacement oligonucleotide. The affinity 
oligonucleotide included DNA bases that were not complementary to the telomerase RNA 5 1 
to the telomerase-specific sequence. As the displacement oligonucleotide was 
complementary to the affinity oligonucleotide for its entire length, it was able to form a more 
thermodynamically stable duplex than the telomerase bound to the affinity oligonucleotide. 
Thus, addition of the displacement oligonucleotide resulted in the elution of the telomerase 
from the column. 

The nuclear extracts prepared from 45 liter cultures were frozen until a total of 
34 ml of nuclear extract was collected. This corresponded to 630 liters of culture (i.e., 
approximately 4 x 10 9 cells). The nuclear extract was diluted with a buffer to 410 ml, to 
provide final concentrations of 20 mM Tris-acetate, 1 mM MgCl 2 , 0.1 mM EDTA, 33 mM 
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KGlu, 10% (vol/vol) glycerol, 1 mM dithiothreitol (DTT), and 0.5 mM phenylmethylsulfonyl 
fluoride (PMSF), at a pH of 7.5. 

The diluted nuclear extract was applied to an Affi-Gel-heparin gel column 
(Bio-Rad), with a 230 ml bed volume and 5 cm diameter, equilibrated in the same buffer and 
eluted with a 2-liter gradient from 33 to 450 mM KGlu. The column was run at 4°C ;( at a 
flow rate of 1 column volume/hour. Fractions of 50 mis each were collected and assayed for 
telomerase activity as described in Part E. Telomerase was eluted from the column at 
approximately 170 mM KGlu. Fractions containing telomerase (approximately 440 ml) were 
pooled and adjusted to 20 mM Tris-acetate, 10 mM MgCl 2 , 1 mM EDTA, 300 mM KGlu, 
10% glycerol, 1 mM DTT, and 1% Nonidet P-40. This buffer was designated as "WB." 

To this preparation, 1.5 nmol of each of two competitor DNA oligonucleotides 
(5 f -TAGACCTGTTAGTGTACATTTGAATTGAAGC-3 f (and (5'- 
TAGACCTGTTAGGTTGGATTTGTGGCATCA-3', 50 jig yeast RNA (Sigma), and 0.3 
nmol of biotin-labeled telomerase-specific oligonucleotide (5 ! -biotin-T AG ACCTGTT A- 
(mreG) 2 -(rmeU)4-(rmeG)4-(rmeU)4-remG-3 , ), were added per ml of the pool. The 2-0- 
methyribonucleotides of the telomerase specific oligonucleotides were complementary to the 
the telomerase RNA; template region; the deoxyribonucleotides were not complementary. 
The inclusion of competitor, non-specific DNA oligonucleotides increased the efficiency of 
the purification, as the effects of nucleic acid binding proteins and other components in the 
mixture that would either bind to the affinity oligonucleotide or remove the telomerase from 
the mixture were minimized. 

This material was then added to Ultralink immobilized neutravidin plus 
(Pierce) column material, at a volume of 60 \il of suspension per ml of pool. The column 
material was pre-blocked twice for 1 5 minutes each blocking, with a preparation of WB 
containing 0.01% Nonidet P-40, 0.5 mg BSA, 0.5 mg/ml lysozyme, 0.05 mg/ml glycogen, 
and 0.1 mg/ml yeast RNA. The blocking was conducted at 4°C, using a rotating wheel to 
block the column material thoroughly. After the first blocking step, and before the second 
blocking step, the column material was centrifiiged at 200 x g for 2 minutes to pellet the 
matrix. 

The pool-column mixture was incubated for 8 minutes at 30 °C, and then for 
an additional 2 hours at 4° C, on a rotating wheel (approximately 10 rpm; Labindustries) to 

199 



allow binding. The pool-column mixture was then centrifuged 200 xg for 2 minutes, and the 
supernatant containing unbound material was removed. The pool-column mixture was then 
washed. This washing process included the steps of rinsing the pool-column mixture with 
WB at 4°C, washing the mixture for 15 minutes with WB at 4°C, rinsing with WB 9 washing 

5 for 5 minutes at 30°C, with WB containing 0.6 M KGlu, and no Nonidet P-40, washing 5 
minutes at 25 °C with WB, and finally, rinsing again with WB. The volume remaining after 
the final wash was kept small, in order to yield a ratio of buffer to column material of 
approximately 1:1. 

Telomerase was eluted from the column material by adding 1 nmol of 

1 0 displacement deoxyoligonucleotide (5 , -CA 4 C 4 A 4 C 2 TA 2 CAG 2 TCTA-3 l ), per ml of column 

material and incubating at 25 °C for 30 minutes. The material was centrifuged for 2 minutes 
at 14,000 rpm in a microcentrifuge (Eppendorf), and the eluate collected. The elution 
procedure was repeated twice more, using fresh displacement oligonucleotide each time. As 
mentioned above, because the displacement oligonucleotide was complementary to the 

1 5 affinity oligonucleotide, it formed a more thermodynamically stable complex with the affinity 
oligonucleotide than P-40. Thus, addition of the displacement oligonucleotide to an affinity- 
bound telomerase resulted in efficient elution of telomerase under native conditions. The 
telomerase appeared to be approximately 50% pure at this stage, as judged by analysis on a 
protein gel. The affinity purification of telomerase and elution with a displacement 

20 oligonucleotide is shown in Figure 26 (panels A and B, respectively). In this Figure, the 2- 
O-methyl sugars of the affinity oligonucleotide are indicated by the bold line. The black and 
shaded oval shapes in this Figure are intended to represent graphically the protein subunits of 
the present invention. 

The protein concentrations of the extract and material obtained following Affi- 

25 Gel-heparin column chromatography were determined using the method of Bradford 

(Bradford, Anal. Biochem., 72:248 [1976]), using BSA as the standard. Only a fraction of 
the telomerase preparation was further purified on a glycerol gradient. 

The sedimentation coefficient of telomerase was determined by glycerol 
gradient centrifugation, as described in Part I. 

30 Table 5 below is a purification table for telomerase purified according to the 

methods of this Example. The telomerase was enriched 12-fold in nuclear extracts, as 
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compared to whole cell extracts, with a recovery of 80%; 85% of telomerase was solubilized 
from nuclei upon extraction. 



TableS. Purification of Telomerase 
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Heparin 
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10 




Affinity 


0.3** 


680 


2270 


40 


2670 


10 


Glycerol 
Gradient 


NA* 


NA* 


NA* 


25 


NA* 



*NA=Not available 

**This value was calculated from the measured amount of telomerase (680 pmol), by 
assuming a purity of 50% (based on a protein gel). 



15 

E. Telomerase Activity 

At each step in the purification of telomerase, the preparation was analyzed by 
three separate assays, one of which was activity, as described in this Example. In general, 
telomerase assays were done in 40 |il containing 0.003-0.3 |il of nuclear extract, 50 mM Tris- 

20 CI (pH 7.5), 50 mM KGlu, 10 mM MgCl 2 , 1 mM DTT, 125 ^iM dTTP, 125 ^M dGTP, and 
approximately 0.2 pmoles of 5 f - 32 P-labelled oligonucleotide substrate (i.e., approximately 
400,000 cpm). Oligonucleotide primers were heat-denatured prior to their addition to the 
reaction mixture. Reactions were assembled on ice and incubated for 30 minutes at 25 °C. 
The reactions were stopped by addition of 200 ^1 of 10 mM Tris-Cl (pH 7.5), 15 mM EDTA, 

25 0.6% SDS, and 0.05 mg/ml proteinase K, and incubated for at least 30 minutes at 45 °C. 
After ethanol precipitation, the products were analyzed on denaturing 8% PAGE gels, as 
known in the art (See e.g., Sambrook et ai, 1989). 
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F. Quantitation of Telomerase Activity 

In this Example, quantitation of telomerase activity through the purification 
procedure is described. Quantitation was accomplished by assaying the elongation of 
oligonucleotide primers in the presence of dGTP and [a - 32 P]dTTP. Briefly, 1 |iM 5HG 4 T 4 ) 2 - 
5 y oligonucleotide was extended in a 20 \il reaction mixture in the presence of 2 \il of [a - 
32 P]dTTP (10 mCi/ml, 400 Ci/mmol; 1 Ci=37 GBq), and 125 |iM dGTP as described 
(Lingner et al, Genes Develop., 8:1984 [1994]) and loaded onto an 8% PAGE sequencing gel 
as described. 

The results of this study are shown in Figure 28. In lane 1, there is no 
10 telomerase present (i.e., a negative control); lanes 2, 5, 8, and 1 1 contained 0.14 finol 

telomerase; lanes 3, 6, 9, and 12 contained 0.42 finol telomerase; and lanes 4, 7, 10, and 13 
contained 1.3 fmol telomerase. Activity was quantitation using a Phosphorlmager (Molecular 
Dynamics) using the manufacturer's instructions. It was determined that under these 
conditions, 1 fmol of affinity-purified telomerase incorporated 21 fmol of dTTP in 30 
1 5 minutes. 

As shown in Figure 28, the specific activity of the telomerase did not change 
significantly through the purification procedure. Affinity-purified telomerase was fully 
active. However, it was determined that at high concentrations, an inhibitory activity was 
detected and the activity of crude extracts was not linear. Thus, in the assay shown in Figure 
20 28, the crude extract was diluted 700-7000-fold. Upon purification, this inhibitory activity 
was removed and no inhibitory effect was detected in the purified telomerase preparations, 
even at high enzyme concentrations. 

G. Gel Electrophoresis and Northern Blots 

25 As stated in Part E, at each step in the purification of telomerase, the 

preparation was analyzed by three separate assays. This Example describes the gel 
electrophoresis and blotting procedures used to quantify telomerase RNA present in fractions 
and analyze the integrity of the telomerase ribonucleoprotein particle, 
i) Denaturing Gels and Northern Blots 

30 In this Example, synthetic T7-transcribed telomerase RNA of known 

concentration served as the standard. Throughout this investigation, the RNA component 
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was used as a measure of telomerase. 

A construct for phage T7 RNA polymerase transcription of E. aediculatus 
telomerase RNA was produced, using (PCR). The telomerase RNA gene was amplified with 
primers that annealed to either end of the gene. The primer that annealed at the 5 1 end also 
5 encoded a hammerhead ribozyme sequence to generate the natural 5 f end upon cleavage of 
the transcribed RNA, a T7-promoter sequence, and an £coRI site for subcloning. The 
sequence of this 5' primer was 5-GCGGGAATTCTAA 

TACGACTCACTATAGGGAAGAAACTCTGATGAGGCCGAAAGGCCGAAACTCCAC 
GAAAGTGGAGTAAGTTTCTCGATAATTGATCTGTAG-3\ The 3* primer included an 

10 Earl site for termination of transcription at the natural 3' end, and a BamUl site for cloning. 
The sequence of this 3' primer was S'-CGGGGATCCTCTTCAAAAG 
ATGAGAGGACAGCAAAC-3'. The PCR amplification product was cleaved with EcoRl 
and Bamm, and subcloned into the respective sites of pUC19 (NEB), to give "pEaT7." The 
correctness of this insert was confirmed by DNA sequencing. T7 transcription was 

15 performed as described by Zaug et al, Biochemistry 33:14935 [1994], with £arl-linearized 
plasmid. RNA was gel-purified and the concentration was determined (an A 260 of 1 = 40 
|ng/ml). This RNA was used as a standard to determine the telomerase RNA present in 
various preparations of telomerase. 

The signal of hybridization was proportional to the amount of telomerase 

20 RNA, and the derived RNA concentrations were consistent with, but slightly higher than 

those obtained by native gel electrophoresis. Comparison of the amount of whole telomerase 
RNA in whole cell RNA to serial dilutions of known T7 RNA transcript concentrations 
indicated that each E. aediculatus cell contained approximately 300,000 telomerase 
molecules. 

25 Visualization of the telomerase was accomplished by Northern blot 

hybridization to its RNA component, using methods as described (Linger et al, Genes 
Develop., 8:1984 [1994]). Briefly, RNA (less than or equal to 0.5 |ig/lane) was resolved on 
an 8% PAGE and electroblotted onto a Hybond-N membrane (Amersham), as known in the 
art (see e.g., Sambrook et al, 1989). The blot was hybridized overnight in 10 ml of 4x SSC, 

30 lOx Denhardt's solution, 0.1% SDS, and 50 jig/ml denatured herring sperm DNA. After pre- 
hybridizing for 3 hours, 2 x 10 6 cpm probe/ml hybridization solution was added. The 
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randomly labelled probe was a PCR-product that covered the entire telomerase RNA gene. 
The blot was washed with several buffer changes for 30 minutes in 2x SSC, 0.1% SDS, and 
then washed for 1 hour in O.lx SSC and 0.1% SDS at 45°C. 

ii) Native Gels and Northern Blots 

In this experiment, the purified telomerase preparation was run on native (i.e., 
non-denaturing) gels of 3.5% polyacrylamide and 0.33% agarose, as known in the art and 
described (Lamond and Sproat, [1994], supra). The telomerase comigrated approximately 
with the xylene cyanol dye. 

The native gel results indicated that telomerase was maintained as an RNP 
throughout the purification protocol. Figure 27 is a photograph of a Northern blot showing 
the mobility of the telomerase in different fractions on a non-denaturing gel as well as in vitro 
transcribed telomerase. In this figure, lane 1 contained 1.5 finol telomerase RNA, lane 2 
contained 4.6 fmol telomerase RNA, lane 3 contained 14 fmol telomerase RNA, lane 4 
contained 41 fmol telomerase RNA, lane 5 contained nuclear extract (42 fmol telomerase), 
lane 6 contained Affi-Gel-heparin-purified telomerase (47 fmol telomerase), lane 7 contained 
affinity-purified telomerase (68 fmol), and lane 8 contained glycerol gradient-purified 
telomerase (35 finol). 

As shown in Figure 27, in nuclear extracts, the telomerase was assembled into 
an RNP particle that migrated slower than unassembled telomerase RNA. Less than 1% free 
RNA was detected by this method. However, a slower migrating telomerase RNP complex 
was also sometimes detected in extracts. Upon purification on the Affi-Gel-heparin column, 
the telomerase RNP particle did not change in mobility (Figure 27, lane 6). However, upon 
affinity purification the mobility of the RNA particle slightly increased (Figure 27, lane 7), 
perhaps indicating that a protein subunit or fragment had been lost. On glycerol gradients, 
the affinity-purified telomerase did not change in size, but approximately 2% free telomerase 
RNA was detectable (Figure 27, lane 8), suggesting that a small amount of disassembly of the 
RNP particle had occurred. 

H. Telomerase Protein Composition 

In this Example, the analysis of the purified telomerase protein composition 
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are described. 

Glycerol gradient fractions obtained as described in Part D, were separated on 
a 4-20% polyacrylamide gel (Novex). Following electrophoresis, the gel was stained with 
Coomassie brilliant blue. Figure 29 shows a photograph of the geL Lanes 1 and 2 contained 
molecular mass markers (Pharmacia) as indicated on the left side of the gel shown in Figure 
29. Lanes 3-5 contained glycerol gradient fraction pools as indicated on the top of the gel 
(i.e., lane 3 contained fractions 9-14, lane 4 contained fractions 15-22, and lane 5 contained 
fractions 23-32). Lane 4 contained the pool with 1 pmol of telomerase RNA. In lanes 6-9 
BSA standards were run at concentrations indicated at the top of the gel in Figure 29 (f.e., 
lane 6 contained 0.5 pmol BSA, lane 7 contained 1.5 pmol BSA, lane 8 contained 4.5 BSA, 
and lane 9 contained 15 pmol BSA). 

As shown in Figure 29, polypeptides with molecular masses of 120 and 43 
kDa co-purified with the telomerase. The 43 kDa polypeptide was observed as a doublet. It 
was noted that the polypeptide of approximately 43 kDa in lane 3 migrated differently than 
the doublet in lane 4; it may be an unrelated protein. The 120 kDa and 43 kDa doublet each 
stained with Coomassie brilliant blue at approximately the level of 1 pmol, when compared 
with BSA standards. Because this fraction contained 1 pmol of telomerase RNA, all of which 
was assembled into an RNP particle (See, Figure 27, lane 8), there appear to be two 
polypeptide subunits that are stoichiometric with the telomerase RNA. However, it is also 
possible that the two proteins around 43 kDa are separate enzyme subunits. 

Affinity-purified telomerase that was not subjected to fractionation on a 
glycerol gradient contained additional polypeptides with apparent molecular masses of 35 and 
37 kDa, respectively. This latter fraction was estimated to be at least 50% pure. However, 
the 35 kDa and 37 kDa polypeptides that were present in the affinity-purified material were 
not reproducibly separated by glycerol gradient centrifugation. These polypeptides may be 
contaminants, as they were not visible in all activity-containing preparations. 

I. Sedimentation Coefficient 

The sedimentation coefficient for telomerase was determined by glycerol 
gradient centrifugation. In this Example, nuclear extract and affinity-purified telomerase 
were fractionated on 15-40% glycerol gradients containing 20 mM Tris-acetate, with 1 mM 
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MgCl 2 , 0.1 mM EDTA, 300 mM KGlu, and 1 mM DTT, at pH 7.5. Glycerol gradients were 
poured in 5 ml (13 x 51 mm) tubes, and centrifuged using an SW55Ti rotor (Beckman) at 
55,000 rpm for 14 hours at 4°C. 

Marker proteins were run in a parallel gradient and had a sedimentation 
coefficient of 7.6 S for alcohol dehydrogenase (ADH), 1 13 S for catalase, 17.3 S for 
apoferritin, and 19.3 S for thyroglobulin. The telomerase peak was identified by native gel 
electrophoresis of gradient fractions followed by blot hybridization to its RNA component. 

Figure 30 is a graph showing the sedimentation coefficient for telomerase. As 
shown in this Figure, affinity-purified telomerase co-sedimented with catalase at 1 1 .5 S, 
while telomerase in nuclear extracts sedimented slightly faster, peaking around 12.5 S. 
Therefore, consistent with the mobility of the enzyme in native gels, purified telomerase 
appears to have lost a proteolytic fragment or a loosely associated subunit. 

The calculated molecular mass for telomerase, if it is assumed to consist of 
one 120 kDa protein subunit, one 43 kDa subunit, and one RNA subunit of 66 kDa, adds up 
to a total of 229 kDa. This is in close agreement with the 232 kDa molecular mass of 
catalase. However, the sedimentation coefficient is a function of the molecular mass, as well 
as the partial specific volume and the frictional coefficient of the molecule, both of which are 
unknown for the Euplotes telomerase RNP. 

J. Substrate Utilization 

In this Example, the substrate requirements of Euplotes telomerase were 
investigated. One simple model for DNA end replication predicts that after semi- 
conservative DNA replication, telomerase extends double-stranded, blunt-ended DNA 
molecules. In a variation of this model, a single-stranded 3' end is created by a helicase or 
nuclease after replication. This 3' end is then used by telomerase for binding and extension. 

To determine whether telomerase is capable of elongating blunt-ended 
molecules, model hairpins were synthesized with telomeric repeats positioned at their 3 1 ends. 
These primer substrates were gel-purified, 5 f -end labelled with polynucleotide kinase, heated 
at 0.4 \xM to 80°C for 5 minutes, and then slowly cooled to room temperature in a heating 
block, to allow renaturation and helix formation of the hairpins. Substrate mobility on a non- 
denaturing gel indicated that very efficient hairpin formation was present, as compared to 
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dimerization. 

Assays were performed with unlabelled 125 |iM dGTP, 125 dTTP, and 
0.02 |iM 5'-end-labelled primer (5'- 32 P-labelled oligonucleotide substrate) in 10 jil reaction 
mixtures that contained 20 mM Tris-acetate, with 10 mM MgCl 2 , 50 mM KGlu, and 1 mM 
5 DTT, at pH 7.5. These mixtures were incubated at 25 °C for 30 minutes. Reactions were 
stopped by adding formamide loading buffer (i.e., TBE, formamide, bromthymol blue, and 
cyanol, Sambrook, 1989, supra). 

Primers were incubated without telomerase ("-"), with 5.9 fmol of affinity- 
purified telomerase ("+"), or with 17.6 finol of affinity-purified telomerase ("+++"). Affinity- 
1 0 purified telomerase used in this assay was dialyzed with a membrane having a molecular cut- 
off of 100 kDa, in order to remove the displacement oligonucleotide. Reaction products were 
separated on an 8% PAGE/urea gel containing 36% formamide, to denature the hairpins. The 
sequences of the primers used in this study, as well as their lane assignments are shown in 
Table 6. 

15 

TABLE 6. Primer Sequences 



Lane 


Primer Sequence (5 1 to 3') 


1-3 


C 4 (A 4 C 4 ) 3 CACA(G 4 T 4 ) 3 G 4 


4-6 


C 2 (A 4 C 4 ) 3 CACA(G 4 T 4 ) 3 G 4 


7-9 


(A 4 C 4 ) 3 CACA(G 4 T 4 ) 3 G 4 


10-12 


A 2 C 4 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 G 4 


13-15 


C 4 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 


16-18 


(AA^CACACG/T^ 


19-21 


A 2 C 4 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 


22-24 


C 4 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 


25-27 


C 2 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 


28-30 


(A4C 4 ) 2 CACA(G 4 T 4 ) 3 



The gel results are shown in Figure 31. Lanes 1-15 contained substrates with 
30 telomeric repeats ending with four G residues. Lanes 16-30 contained substrates with 

telomeric repeats ending with four T residues. The putative alignment on the telomerase 
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RNA template is indicated in Figure 32. It was assumed that the primer sets anneal at two 
very different positions in the template shown in Figure 32 (ie., Panel A and Panel B, 
respectively). This may have affected their binding and/or elongation rate. 

Figure 33 shows a lighter exposure of lanes 25-30 in Figure 31. The lighter 
exposure of Figure 33 was taken to permit visualization of the nucleotides that are added and 
the positions of pausing in elongated products. Percent of substrate elongated for the third 
lane in each set was quantified on a Phosphorlmager, as indicated on the bottom of Figure 3 1 . 

The substrate efficiencies for these hairpins were compared with double- 
stranded telomere-like substrates with overhangs of differing lengths. A model substrate that 
ended with four G residues (see lanes 1-15 of Figure 31) was not elongated when it was blunt 
ended (see lanes 1-3). However, slight extension was observed with an overhang length of 
two bases; elongation became efficient when the overhang was at least 4 bases in length. The 
telomerase acted in a similar manner with a double-stranded substrate that ended with four T 
residues, with a 6-base overhang required for highly efficient elongation. In Figure 31, the 
faint bands below the primers in lanes 10-15 that are independent of telomerase represent 
shorter oligonucleotides in the primer preparations. 

The lighter exposure of lanes 25-30 in Figure 33 shows a ladder of elongated 
products, with the darkest bands correlating with the putative 5 r boundary of the template (as 
described by Lingner et aL, Genes Develop., 8:1984 [1994]). The abundance of products that 
correspond to other positions in the template suggested that pausing and/or dissociation 
occurs at sites other than the site of translocation with the purified telomerase. 

As shown in Figure 31, double-stranded, blunt-ended oligonucleotides were 
not substrates for telomerase. To determine whether these molecules would bind to 
telomerase, a competition experiment was performed. In this experiment, 2 nM of 5 ! -end 
labeled substrate with the sequence (G 4 T 4 ) 2 , or a hairpin substrate with a six base overhang 
were extended with 0.125 nM telomerase (Figure 31, lanes 25-27). Although the same 
unlabeled oligonucleotide substrates competed efficiently with labeled substrate for 
extension, no reduction of activity was observed when the double-stranded blunt-ended 
hairpin oligonucleotides were used as competitors, even in the presence of 100-fold excess 
hairpins. 

These results indicated that double-stranded, blunt-ended oligonucleotides 



208 



cannot bind to telomerase at the concentrations and conditions tested in this Example. 
Rather, a single-stranded 3' end is required for binding. It is likely that this 3' end is required 
to base pair with the telomerase RNA template. 

K. Cloning & Sequencing of the 123 kDa Polypeptide 

In this Example, the cloning of the 123 kDa polypeptide of Euplotes 
telomerase (i.e., the 123 kDa protein subunit) is described. In this study, an internal fragment 
of the telomerase gene was amplified by PCR, with oligonucleotide primers designed to 
match peptide sequences that were obtained from the purified polypeptide obtained in Part D, 
above. The polypeptide sequence was determined using the nanoES tandem mass 
spectroscopy methods known in the art and described by Calvio et al. 9 RNA 1 : 724-733 
[1995]. The oligonucleotide primers used in this Example had the following sequences, with 
positions that were degenerate shown in parentheses--5'-TCT(G/A) 
AA(G/A)TA(G/A)TG(T/G/A)GT(G/A/T/C)A(T/G/A)(G/A)TT(G/A)TTCAT-3', and 

S'-GCGGATCCAT^^ 

A 50 [il reaction contained 0.2 mM dNTPs, 0.15 jxg E. aediculatus 
chromosomal DNA, 0.5 ^1 Taq (Boehringer-Mannheim), 0.8 ^g of each primer, and Ix 
reaction buffer (Boehringer-Mannheim). The reaction was incubated in a thermocycler 
(Perkin-Elmer), using the following--5 minutes at 95 °C, followed by 30 cycles of 1 minute at 
94°C, 1 minute at 52°C, and 2 minutes at 72°C. The reaction was completed by a 10 minute 
incubation at 72 °C. 

A genomic DNA library was prepared from the chromosomal E. aediculatus 
DNA by cloning blunt-ended DNA into the Smal site of pCR-Script plasmid vector 
Figure 14(Stratagene). This library was screened by colony hybridization, with the 
radiolabeled, gel-purified PCR product. Plasmid DNA of positive clones was prepared and 
sequenced by the dideoxy method (Sanger et aU Proc. Natl. Acad. ScL, 74:5463 [1977]) or 
manually, through use of an automated sequencer (ABI). The DNA sequence of the gene 
encoding this polypeptide is shown in Figure 13. The start codon in this sequence inferred 
from the DNA sequence, is located at nucleotide position 101, and the open reading frame 
ends at position 3 193. The genetic code of Euplotes differs from other organisms in that the 
"UGA" codon encodes a cysteine residue. The amino acid sequence of the polypeptide 
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inferred from the DNA sequence is shown in Figure 14, and assumes that no unusual amino 
acids are inserted during translation and no post-translational modification occurs. 

L. Cloning & Sequencing of the 43 kDa Polypeptide 

5 In this Example, the cloning of the 43 kDa polypeptide of telomerase (i.e., the 

43 kDa protein subunit) is described. In this study, an internal fragment of the corresponding 
telomerase gene was amplified by PCR, with oligonucleotide primers designed to match 
peptide sequences that were obtained from the purified polypeptide obtained in Part D, above. 
The polypeptide sequence was determined using the nanoES tandem mass spectroscopy 

1 0 methods known in the art and described by Calvio et al, supra. The oligonucleotide primers 
used in this Example had the following sequences-5*- 

NNNGTNAC(C/T/A)GG(C/T/A)AT(C/T/A)AA(C/T)AA-3* , and 5'-(T/G/A)GC 
(T/G/A)GT(C/T)TC(T/C)TG(G/A)TC(G/A)TT(G/A)TA-3'. In this sequence, "N" indicates 
the presence of any of the four nucleotides (i.e., A, T, G, or C). 
\ 5 The PCR was performed as described in Part K. 

A genomic DNA library was prepared and screened as described in Part K. 
The DNA sequence of the gene encoding this polypeptide is shown in Figure 34. Three 
potential reading frames are shown for this sequence, as shown in Figure 35. For clarity, the 
amino acid sequence is indicated below the nucleotide sequence in all three reading frames. 
20 These reading frames are designated as "a," "b," and "c". A possible start codon is encoded at 
nucleotide position 84 in reading frame "c." The coding region could end at position 1501 in 
reading frame "b." Early stop codons, indicated by asterisks in this figure, occur in all three 
reading frames between nucleotide position 337-350. 

The "La-domain" is indicated in bold-face type. Further downstream., the 
25 protein sequence appears to be encoded by different reading frames, as none of the three 
frames is uninterrupted by stop codons. Furthermore, peptide sequences from purified 
protein are encoded in all three frames. Therefore, this gene appears to contain intervening 
sequences, or in the alternative, the RNA is edited. Other possibilities include ribosomal 
frame-shifting or sequence errors. However, the homology to the La-protein sequence 
30 remains of significant interest. Again, in Euplotes, the "UGA" codon encodes a cysteine 
residue. 
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M. Amino Acid and Nucleic Acid Comparisons 

In this Example, comparisons between various reported sequences and the 
sequences of the 123 kDa and 43 kDa telomerase subunit polypeptides were made. 

i) Comparisons with the 123 kDa E. aediculatus Telomerase Subunit 

The amino acid sequence of the 123 kDa Euplotes aediculatus polypeptide 
was compared with the sequence of the 80 kDa telomerase protein subunit of Tetrahymena 
thermophila (GenBank accession #U25641) to investigate their similarity. The nucleotide 
sequence as obtained from GenBank encoding this protein is shown in Figure 42. The amino 
acid sequence of this protein as obtained from GenBank is shown in Figure 43. The sequence 
comparison between the 123 kDa E. aediculatus and 80 kDa T. thermophila is shown in 
Figure 36. In this figure, the E. aediculatus sequence is the upper sequence, while the T. 
thermophila sequence is the lower sequence. The observed identity was determined to be 
approximately 19%, while the percent similarity was approximately 45%, values similar to 
what would be observed with any random protein sequence. In Figures 36-39, identities are 
indicated by vertical bars, while single dots between the sequences indicate somewhat similar 
amino acids, and double dots between the sequences indicate more similar amino acids. 

The amino acid sequence of the 123 kDa Euplotes aediculatus polypeptide was also 
compared with the sequence of the 95 kDa telomerase protein subunit of Tetrahymena 
thermophila (GenBank accession #U25642), to investigate their similarity. The nucleotide 
sequence as obtained from GenBank encoding this protein is shown in Figure 44. The amino 
acid sequence of this protein as obtained from GenBank is shown in Figure 45. This 
sequence comparison is shown in Figure 37. In this figure, the E aediculatus sequence is the 
upper sequence), while the T. thermophila sequence is the lower sequence. The observed 
identity was determined to be approximately 20%, while the percent similarity was 
approximately 43%, values similar to what would be observed with any random protein 
sequence. 

Significantly, the amino acid sequence of the 123 kDa E. aediculatus 
polypeptide contains the five motifs characteristic of reverse transcriptases. The 123 kDa 
polypeptide was also compared with the polymerase domains of various reverse 
transcriptases. Figure 40 shows the alignment of the 123 kDa polypeptide with the putative 
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yeast homolog (L8543.12 or ESTp). The amino acid sequence of L8543.12 obtained from 
GenBank is shown in Figure 46. 

Four motifs (A, B, C, and D) were included in this comparison. In this Figure 
40, highly conserved residues are indicated by white letters on a black background. Residues 
5 of the E. aediculatus sequences that are conserved in the other sequence are indicated in bold; 
the "h" indicates the presence of a hydrophobic amino acid. The numerals located between 
amino acid residues of the motifs indicates the length of gaps in the sequences. For example, 
the "100" shown between motifs A and B reflects a 100 amino acid gap in the sequence 
between the motifs. 

10 As noted above, Genbank searches identified a yeast protein (Genbank 

accession #u20618), and gene L8543.12 (Est2) containing or encoding amino acid sequence 
that shows some homology to the E. aediculatus 123 kDa telomerase subunit Based on the 
observations that both proteins contain reverse transcriptase motifs in their C-terminal 
regions; both proteins share similarity in regions outside the reverse transcriptase motif; the 

15 proteins are similarly basic (pi = 10.1 for E. aediculatus and pl=10.0 for the yeast); and both 
proteins are large (123 kDa for E. aediculatus and 103 kDa for the yeast), these sequences 
comprise the catalytic core of their respective telomerases. It was contemplated based on this 
observation of homology in two phylogenetically distinct organisms as E. aediculatus and 
yeast, that human telomerase would contain a protein that has the same characteristics {i.e., 

20 reverse transcriptase motifs, is basic, and large [> 100 kDa]). 

ii) Comparisons with the 43 kDa E. aediculatus Telomerase Subunit 

The amino acid sequence of the M La-domain M of the 43 kDa Euplotes 
aediculatus polypeptide was compared with the sequence of the 95 kDa telomerase protein 

25 subunit of Tetrahymena thermophila (described above) to investigate their similarity. This 
sequence comparison is shown in Figure 38, while the T. thermophila sequence is the lower 
sequence. The observed identity was determined to be approximately 23%, while the percent 
similarity was approximately 46%, values similar to what would be observed with any 
random protein sequence. 

30 The amino acid sequence of the "La-domain" of the 43 kDa Euplotes 

aediculatus polypeptide was compared with the sequence of the 80 kDa telomerase protein 
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subunit of Tetrahymena thermophila (described above) to investigate their similarity. This 
sequence comparison is shown in Figure 39. In this figure, the E, aediculatus sequence is the 
upper sequence, while the 71 thermophila sequence is the lower sequence. The observed 
identity was determined to be approximately 26%, while the percent similarity was 
5 approximately 49%, values similar to what would be observed with any random protein 
sequence. 

The amino acid sequence of a domain of the 43 kDa E. aediculatus 
polypeptide was also compared with La proteins from various other organisms. These 
comparisons are shown in Figure 41 . In this Figure, highly conserved residues are indicated 
10 by white letters on a black background. Residues of the E. aediculatus sequences that are 
conserved in the other sequence are indicated in bold. 

N. Identification of Telomerase Protein Subunits in Another Organism 

In this Example, the sequences identified in the previous Examples above were 
15 used to identify the telomerase protein subunits of Oxytricha trifallax, a ciliate that is very 

distantly related to E. aediculatus. Primers were chosen based on the conserved region of the 
E. aediculatus 123 kDa polypeptide which comprised the reverse transcriptase domain 
motifs. Suitable primers were synthesized and used in a PCR reaction with total DNA from 
Oxytricha. The Oxytricha DNA was prepared according to methods known in the art. The 
20 PCR products were then cloned and sequenced using methods known in the art. 

The oligonucleotide sequences used as the primers were as follows: 
5'-(T/C)A(A/G)A^ 

GG-3' and 5 , -(G/A/T)GT(G/A/T)ATNA(G/A)NA(G/A)(G/A)TA(G/A)TC(G/A)TC.3 t ). 
Positions that were degenerate are shown in parentheses, with the alternative bases shown 

25 within the parenthesis. "N" represents any of the four nucleotides. 

In the PCR reaction, a 50 |il reaction contained 0.2 mM dNTPs, 0.3 jig 
Oxytricha trifallax chromosomal DNA, 1 \i\ Tag polymerase (Boehringer-Mannheim), 2 
micromolar of each primer, lx reaction buffer (Boehringer-Mannheim). The reaction was 
incubated in a thermocycler (Perkin-Elmer) under the following conditions: 5 min at 95 °C, 

30 30 cycles consisting of 1 min at 94°C, 1 min at 53 °C, and 1 min at 72 °C, followed by a 10 
min incubation at 72°C. The PCR-product was gel-purified and sequenced by the 
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dideoxy-method (e.g., Sanger et al, Proc. Natl. Acad. Sci. 74, 5463-5467 (1977). 

The deduced amino acid sequence of the PCR product was determined and 
compared with the E. aediculatus sequence. Figure 47 shows the alignment of these 
sequences, with the O, trifallax sequence shown in the top row, and the £. aediculatus 
5 sequence shown in the bottom row. As can be seen from this figure, there is a great deal of 
homology between the O. trifallax polypeptide sequence identified in this Example with the 
E. aediculatus polypeptide sequence. Thus, it is clear that the sequences identified in the 
present invention are useful for the identification of homologous telomerase protein subunits 
in other eukaryotic organisms. Indeed, development of the present invention has identified 
10 homologous telomerase sequences in multiple, diverse species, as described herein. 

O. Identification of Tetrahymena Telomerase Sequences 

In this Example, a Tetrahymena clone was produced that shares homology 
with the Euplotes sequences, and EST2p. 

1 5 This experiment utilized PCR with degenerate oligonucleotide primers 

directed against conserved motifs to identify regions of homology between Tetrahymena, 
Euplotes, and EST2p sequences. The PCR method used in this Example is a novel method 
designed to amplify specifically rare DNA sequences from complex mixtures. This method 
avoids the problem of amplification of DNA products with the same PCR primer at both ends 

20 {i.e., single primer products) commonly encountered in PCR cloning methods. These single 
primer products produce unwanted background and can often obscure the amplification and 
detection of the desired two-primer product. The method used in this experiment 
preferentially selects for two-primer products. In particular, one primer is biotinylated and 
the other is not. After several rounds of PCR amplification, the products are purified using 

25 streptavidin magnetic beads and two primer products are specifically eluted using heat 

denaturation. This method finds use in settings other than the experiments described in this 
Example. Indeed, this method finds use in application in which it is desired to specifically 
amplify rare DNA sequences, including the preliminary steps in cloning methods such as 5 f 
and 3; RACE, and any method that uses degenerate primers in PCR. 

30 A first PCR run was conducted using Tetrahymena template macronuclear 

DNA isolated using methods known in the art, and the 24-mer forward primer with the 
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sequence 5* biotin-GCCTATTT(TC)TT(TC)TA(TC)(GATC)(GATC) 
(GATC)AC(GATC)GA-3 ' designated as "K231," corresponding to the FFYXTE region, and 
the 23-mer reverse primer with the sequence 5'- CCAGATAT(GATC)A 
(TGA)(GATC)A(AG)(AG)AA(AG)TC(AG)TC- 3% designated as "K220," corresponding to 
5 the DDFL(FIL)I region. This PCR reaction contained 2.5 nl DNA (50 ng), 4 \xl of each 
primer (20 jiM), 3 \xl lOx PCR buffer, 3 jxl lOx dNTPs, 2 jil Mg, 0.3 \x\ Tag, and 1 1.2 |il 
dH 2 0. The mixture was cycled for 8 cycles of 94 °C for 45 seconds, 37 °C for 45 seconds, 
and 72 °C for 1 minute. 

This PCR reaction was bound to 200 |il streptavidin magnetic beads, washed 

10 with 200 ^1 TE, resuspended in 20 \xl dH 2 0 and then heat-denatured by boiling at 100° C for 
2 minutes. The beads were pulled down and the eluate removed. Then, 2.5 \xl of this eluate 
was subsequently reamplified using the above conditions, with the exception being that 0.3 \il 
of a - 32 P dATP was included, and the PCR was carried out for 33 cycles. This reaction was 
run a 5% denaturing polyacrylamide gel, and the appropriate region was cut out of the gel. 

1 5 These products were then reamplified for an additional 34 cycles, under the conditions listed 
above, with the exception being that a 42 °C annealing temperature was used. 

A second PCR run was conducted using Tetrahymena macronuclear DNA 
template isolated using methods known in the art, and the 23-mer forward primer with the 
sequence 5'- ACAATG(CA)G(GATC)(TCA)T(GATC)(TCA)T(GATC)CC 

20 (G ATC) AA(AG) AA-3 ' , designated as "K228," corresponding to the region R(LI)(LI)PKK , 
and a reverse primer with the sequence 5 T -ACGAATC(GT)(GATC)GG 
(TAG)AT(GATC)(GC)(TA)(AG)TC(AG)TA(AG)CA 3' , designated M K224," corresponding 
to the CYDSIPR region. This PCR reaction contained 2.5 \il DNA (50 ng), 4 \i\ of each 
primer (20 ^M), 3 |xl lOx PCR buffer, 3 yX lOx dNTPs, 2 \x\ Mg, 0.3 \il a - 32 P dATP, 0.3 |al 

25 Taq, and 10.9 |il dH 2 0. This reaction was run on a 5% denaturing polyacrylamide gel, and 
the appropriate region was cut out of the gel. These products were reamplified for an 
additional 34 cycles, under the conditions listed above, with the exception being that a 42 °C 
annealing temperature was used. 

Ten \xl of the reaction product from run 1 were bound to streptavidin-coated 

30 magnetic beads in 200 ^il TE. The beads were washed with 200 jal TE, and then resuspended 
in 20 \il of dH 2 Q, heat denatured, and the eluate was removed. The reaction product from run 
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2 was then added to the beads and diluted with 30 ul 0.5x SSC. The mixture was heated from 
94°C to 50°C. The eluate was removed and the beads were washed three times in 0.5x SSC 
at 55 °C. The beads were then resuspended in 20 ul dH 2 0, heat denatured, and the eluate was 
removed, designated as "round 1 eluate" and saved. 
5 To isolate the Tetrahymena band, the round 1 eluate was reamplified with the 

forward primer K228 and reverse primer K227 with the sequence 
5'- CAATTCTC(AG)TA(AG)CA(GATC)(CG)(TA)(CT)TT(AGT)AT(GA)TC-3' , 
corresponding to the DIKSCYD region. The PCR reactions were conducted as described 
above. The reaction products were run on a 5% polyacrylamide gel; the band corresponding 
10 to approximately 295 nucleotides was cut from the gel and sequenced. 

The clone designated as 168-3 was sequenced. The DNA sequence (including 
the primer sequences) was found to be: 

GATTACTCCCGAAGAAAGGATCTTTCCGTCCAATCATGACTTTCTTAAGAAAGGA 

CAAGCAAAAAAATATTAAGTTAAATCTAAATTAAATTCTAATGGATAGCCAACTT 
1 5 GTGTTT AGGAATTT AAAAGAC ATGCTGGGATAAAAGATAGGATACTC AGTCTTTG 

ATAATAAACAAATTTCAGAAAAATTTGCCTAATTCATAGAGAAATGGAAAAATA 

AAGGAAGACCTCAGCTATATTATGTCACTCTAGACATAAAGACTTGCTAC. 

Additional sequence of this gene was obtained by PCR using one unique 

primer designed to match the sequence from 168-3 ("K297" with the sequence 
20 5'-GAGTGACATAATATACGTGA-3'; and the K23 1 (FFYXTE) primer. The sequence of 

the fragment obtained from this reaction, together with 168-3 is as follows (without the 

primer sequences): 

AAACACAAGGAAGGAAGTCAAATATTCTATTACCGTAAACCAATATGGAAATTA 
GTGAGTAAATTAACTATTGTCAAAGTA^GAATTTAGTTTTCTGAAAAGAATAAAT 

25 AAATGAAAAATAATTTTTATCAAAAAATTTAGCTTGAAGAGGAGAATTTGGAAA 
AAGTTGAAGAAAAATTGATACCAGAAGATTCATTTTAGAAATACCCTCAAGGAA 
AGCTAAGGATTATACCTAAAAAAGGATCTTTCCGTCCAATCATGACTTTCTTAAG 
AAAGGACAAGCAAAAAAATATTAAGTTAAATCTAAATTAAATTCTAATGGATAG 
CCAACTTGTGTTTAGGAATTTAAAAGACATGCTGGGATAAAAGATAGGATACTCA 

30 GTCTTTGATAATAAACAAATTTCAGAAAAATTTGCCTAATTCATAGAGAAATGGA 
AAAATAAAGGAAGACCTCAGCTATATTATGTCACTCTA. 
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The amino acid sequence corresponding to this DNA fragment was found to 

be: 

KHKEGSQIFYYRKPIWKLVSK^ 
KLIPEDSFQKYPQGKLRIIPKKGSF 
5 DMLGQKIGYSVFDNKQISEKFAQFIEKWKNKGRPQLYYVTL. 

This amino acid sequence was then aligned with other telomerase genes 
(EST2p, and Euplotes). The alignment is shown in Figure 53. A consensus sequence is also 
shown in this Figure. 

10 P. Identification of Schizosaccharomyces pombe Telomerase Sequences 

In this Example, the tezl sequence of S. pombe was identified as a homolog of 
the E. aediculatus pi 23, and S. cerevisiae Est2p. 

Figure 55 provides an overall summary of these experiments. In this Figure, 
the top portion (Panel A) shows the relationship of two overlapping genomic clones, and the 
15 5825 bp portion that was sequenced. The region designated at '7ez/ + " is the protein coding 
region, with the flanking sequences indicated as well, the box underneath the 5825 bp region 
is an approximately 2 kb Zfi'wdlll fragment that was used to make the tezl disruption 
construct, as described below. 

The bottom half of Figure 55 (Panel B) is a "close-up" schematic of this same 
20 region of DNA. The sequence designated as "original PCR" is the original degenerate PCR 
fragment that was generated with a degenerate oligonucleotide primer pair designed based on 
Euplotes sequence motif 4 (B ! ) and motif 5 (C), as described. 

i) PCR With Degenerate Primers 

25 PCR using degenerate primers was used to find the homolog of the E. 

aediculatus pi 23 in S. pombe. Figure 56 shows the sequences of the degenerate primers 
(designated as "poly 4" and "poly 1") used in this reaction. The PCR runs were conducted 
using the same buffer as described in previous Examples (See e.g., Part K, above), with a 5 
minute ramp time at 94°C, followed by 30 cycles of 94°C for 30 seconds, 50°C for 45 

30 seconds, and 72°C for 30 seconds, and 7 minutes at 72°C, followed by storage at 4°C. PCR 
runs were conducted using varied conditions, (i.e., various concentrations of & pombe DNA 
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and MgCl 2 concentrations). The PCR products were run on agarose gels and stained with 
ethidium bromide as described above. Several PCR runs resulted in the production of three 
bands (designated as "T," "M," and "B"). These bands were re-amplified and run on gels 
using the same conditions as described above. Four bands were observed following this re- 
5 amplification ("T," "Ml," "M2," and "B"), as shown in Figure 57. These four bands were 
then re-amplified using the same conditions as described above. The third band from the top 
of the lane in Figure 57 was identified as containing the correct sequence for a telomerase 
protein. The PCR product designated as M2 was found to show a reasonable match with 
other telomerase proteins, as indicated in Figure 58. In addition to the alignment shown, this 
1 0 Figure also shows the actual sequence of tezl. In this Figure, the asterisks indicate residues 
shared with all four sequences (Oxytricha "Ot"; E. aediculatus "Ea_pl23"; S. cerevisiae 
"Sc _pl03"; and M2), while the circles (i.e., dots) indicate similar amino acid residues. 



on the telomerase candidate identified in Figure 58. Figure 59 provides a schematic of the 3* 
RT PCR strategy used. First, cDNA was prepared from mRNA using the oligonucleotide 
primer "Q T ," (S'-CCA GTG AGC AGA GTG ACG AGG ACT CGA GCT CAA GCT TTT 
TTT TTT TTT TT-3'), then using this cDNA as a template for PCR with "Q 0 " (5'-CCA GTG 

20 AGC AGA GTG ACG-3'), and a primer designed based on the original degenerated PCR 
reaction (z.e., "M2-T" with the sequence 5'-G TGT CAT TTC TAT ATG GAA GAT TTG 
ATT GAT G-3'). The second PCR reaction (i.e., nested PCR) with "Qj" (5 r -GAG GAC TCG 
AGC TCA AGC-3'), and another PCR primer designed with sequence derived from the 
original degenerate PCR reaction or "M2-T2" (5'-AC CTA TCG TTT ACG AAA AAG AAA 

25 GGA TCA GTG-3 1 ). The buffers used in this PCR were the same as described above, with 

amplification conducted beginning with a ramp up of 94° for 5 min, followed by 30 cycles of 
94° for 30 sec, 55°C for 30 sec, and 72°C for 3 min, followed by 7 minutes at 72°C. The 
reaction products were stored at 4°C until use. 

30 iii) Screening of Genomic and cDNA Libraries 



15 



ii) 3 f RT PCR 

To obtain additional sequence information. 



l, 3' and 5 1 RT PCR were conducted 



After obtaining this additional sequence information, several genomic and 
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cDNA libraries were screened to identify any libraries that contain this telomerase candidate 
gene. The approach used, as well as the libraries and results are shown in Figure 60. In this 
Figure, Panel A lists the libraries tested in this experiment; Panel B shows the regions used; 
Panels C and D show the dot blot hybridization results obtained with these libraries. Positive 

5 libraries were then screened by colony hybridization to obtain genomic and cDNA version of 
tezl gene. In this experiment, approximately 3 x 10 4 colonies from the Hindlll genomic 
library were screened and six positive clones were identified (approximately 0.01%). DNA 
was then prepared from two independent clones (A5 and B2). Figure 61 shows the results 
obtained with the i/mdIII-digested A5 and B2 positive genomic clones. 

10 In addition, cDNA REP libraries were used. Approximately 3 x 10 5 colonies 

were screened, and 5 positive clones were identified (0.002%). DNA was prepared from 
three independent clones (2-3, 4-1, and 5-20). In later experiments, it was determined that 
clones 2-3 and 5-20 contained identical inserts. 



15 iv)5'RTPCR 

As the cDNA version of gene produced to this point was not complete, 5' 
RT-PCR was conducted to obtain a full length clone. The strategy is schematically shown in 
Figure 62. In this experiment, cDNA was prepared using DNA oligonucleotide primer "M2- 
B" (5'-CAC TGA TCC TTT CTT TTT CGT AAA CGA TAG GT-3') and "M2-B2 H (5'-C 

20 ATC AAT CAA ATC TTC CAT ATA GAA ATG ACA-3'), designed from known regions of 
tezl identified previously. An oligonucleotide linker PCR Adapt Sfil with a phosphorylated 
5 f end ("P") (P-GGG CCG TGT TGG CCT AGT TCT CTG CTC-3 f ; was then ligated at the 3' 
end of this cDNA, and this construct was used as the template for nested PCR. In the first 
round of PCR, PCR Adapt SFI and M2-B were used as the primers; while PCR Adapt Sfill 

25 (5-GAG GAG GAG AAG AGC AGA GAA CTA GGC CAA CAC GCC CC-3'), and M2-B2 
were used as primers in the second round. Nested PCR was used to increase specificity of 
reaction. 



v) Sequence Alignments 

30 Once the sequence of tezl was identified, it was compared with sequences 

previously described. Figure 63 shows the alignment of RT domains from telomerase 
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catalytic subunits of S. pombe ("S.p. Tezlp"), S. cerevisiae ( M S.c. Est2p n ), and E. aediculatus 
pl23 ("E.a. pl23"). In this Figure, n h t1 indicates hydrophobic residues, while "p" indicates 
small polar residues, and "c M indicates charged residues. The amino acid residues indicated 
above the alignment show a known consensus RT motif of Y. Xiong and T.H. Eickbush (Y. 
Xiong and T.H. Eickbush, EMBO J., 9: 3353-3362 [1990]). The asterisks indicate the 
residues that are conserved for all three proteins. "Motif O" is identified herein and in Figure 
63 as a motif specific to this telomerase subunit and not found in reverse transcriptases in 
general. It is therefore valuable in identifying other amino acid sequences as telomerase 

catalytic subunits. 

Figure 64 shows the alignment of entire sequences from Euplotes 
("Ea_pl23"), S. cerevisiae ( M Sc_Est2p"), and S. pombe ("Sp_Tezlp"). In Panel A, the shaded 
areas indicate residues shared between two sequences. In Panel B, the shaded areas indicate 
residues shared between all three sequences. 

vi) Genetic Disruption of tezl 

In this Example, the effects of disruption of tezl were investigated. As 
telomerase is involved in telomere maintenance, it was hypothesized that if tezl were indeed 
a telomerase component, disruption of tezl would cause gradual telomere shortening. 

In these experiments, homologous recombination was used to disrupt the tezl 
gene in S. pombe specifically. This approach is schematically illustrated in Figure 65. As 
indicated in Figure 65, wild type tezl was replaced with a fragment containing the ura4 or 
LEU2 marker. 

The disruption of tezl gene was confirmed by PCR (Figure 66), and a 
Southern blot was performed to check for telomere length. Figure 67 shows the Southern 
blot results for this experiment. Because an Apal restriction enzyme site is present 
immediately adjacent to telomeric sequence in S. pombe, Apal digestion of S. pombe genomic 
DNA preparations permits analysis of telomere length. Thus, DNA from S. pombe was 
digested with Apal and the digestion products were run on an agarose gel and probed with a 
telomeric sequence-specific probe to determine whether the telomeres of disrupted S. pombe 
cells were shortened. The results are shown in Figure 67. From these results, it was clear 
that disruption of the tezl gene caused a shortening of the telomeres. 
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11 # 

Q. Cloning and Characterization of Human Telomerase Protein and cDNA 

In this Example, the nucleic and amino acid sequence information for human 
telomerase was determined. Partial homologous sequences were first identified in a BLAST 
search conducted using the Euplotes 123 kDa peptide and nucleic acid sequences, as well as 
Schizosaccharomyces protein and corresponding cDNA (tezl) sequences. The human 
sequences (also referred to as "hTCPl.l") were identified from a partial cDNA clone (clone 
712562). Sequences from this clone were aligned with the sequences determined as described 
in previous Examples. 

Figure 1 shows the sequence alignment of the Euplotes ("pi 23"), 
Schizosaccharomyces ("tezl"), Est2p (i.e. 9 the S. cerevisiae protein encoded by the Est2 
nucleic acid sequence, and also referred to herein as "L8543.12"), and the human homolog 
identified in this comparison search. Figure 51 shows the amino acid sequence of tezl, while 
Figure 52 shows the DNA sequence of tezl. In Figure 52, the introns and other non-coding 
regions, are shown in lower case, while the exons (i.e., coding regions) are shown in upper 
case. 

As shown in the Figures, there are regions that are highly conserved among 
these proteins. For example, as shown in Figure 1, there are regions of identity in "Motif 0," 
"Motif 1, "Motif 2," and "Motif 3." The identical amino acids are indicated with an asterisk 
(*), while the similar amino acid residues are indicated by a circle (•). This indicates that 
there are regions within the telomerase motifs that are conserved among a wide variety of 
eukaryotes, ranging from yeast to ciliates to humans. It is contemplated that additional 
organisms will likewise contain such conserved regions of sequence. Figure 49 shows the 
partial amino acid sequence of the human telomerase motifs, while Figure 50 shows the 
corresponding DNA sequence. 

Sanger dideoxy sequencing and other methods were used, as known in the art 
to obtain complete sequence information of clone 712562. Some of the primers used in the 
sequencing are shown in Table 7. These primers were designed to hybridize to the clone), 
based on sequence complementarity to either plasmid backbone sequence or the sequence of 
the human cDNA insert in the clone. 
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Table 7. Primers 



5 



10 
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Primer 


Sequence 


TCP1.1 


GTGAAGGCACTGTTCAGCG 


TCP 1.2 


GTGGATGATTTCTTGTTGG 


TCP 1.3 


ATGCTCCTGCGTTTGGTGG 


TCP1.4 


CTGGACACTCAGCCCTTGG 


TCP1.5 


GGCAGGTGTGCTGGACACT 


TCP 1.6 


TTTGATGATGCTGGCGATG 


TCP 1.7 


GGGGCTCGTCTTCTACAGG 


TCP1.8 


CAGCAGGAGGATCTTGTAG 


TCP 1.9 


TGACCCCAGGAGTGGCACG 


TCP1.10 


TCAAGCTGACTCGACACCG 


TCP1.11 


CGGCGTGACAGGGCTGC 


TCP1.12 


GCTGAAGGCTGAGTGTCC 


TCP1.13 


TAGTCCATGTTCACAATCG 



From these experiments, it was determined that the EcoRI-NotI insert of clone 
712562 contains only a partial open reading frame for the human telomerase protein, although 

20 it may encode an active fragment of that protein. The open reading frame in the clone 

encodes an approximately 63 kD protein. The sequence of the longest open reading frame 
identified is shown in Figure 68. The ORF begins at the ATG codon with the "met" indicated 
in the Figure, The poly A tail at the 3' end of the sequence is also shown. Figure 69 shows a 
tentative, preliminary alignment of telomerase reverse transcriptase proteins from the human 

25 sequence (human Telomerase Core Protein 1, "Hs TCP1"), E. aediculatus pl23 ("Ep pl23), 
S. pombe tezl ("Sp TezP), S. cerevisiae EST2 (Sc Est2 M ), and consensus sequence. In this 
Figure various motifs are indicated. 

To obtain a full-length clone, probing of a cDNA library and 5 '-RACE were 
used to obtain clones encoding portions of the previously uncloned regions. In these 

30 experiments, RACE (Rapid Amplification of cDNA Ends; See e.g., MA. Frohman, "RACE: 
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Rapid Amplification of cDNA Ends," in Innis et al (eds), PGR Protocols: A Guide to 
Methods and Applications [1990], pp. 28-38; and Frohman et aL, Proc. NatL Acad, ScL, 
85:8998-9002 [1988]) was used to generate material for sequence analysis. Four such clones 
were generated and used to provide additional 5 f sequence information (pFWRPS, 6, 19, and 
20). 

In addition, human cDNA libraries (inserted into lambda) were probed with 
the EcoRI-NotI fragment of the clone. One lambda clone, designated "lambda 25-1.1" 
(ATCC accession #209024), was identified as containing complementary sequences. Figure 
75 shows a restriction map of this lambda clone. The human cDNA insert from this clone 
was subcloned as an EcoKL restriction fragment into the EcoKl site of commercially available 
phagemid pBluescriptIISK+ (Stratagene), to create the plasmid "pGRN121, M which was 
deposited with the ATCC (ATCC accession #209016). Preliminary results indicated that 
plasmid pGRN121 contains the entire open reading frame (ORF) sequence encoding the 
human telomerase protein. 

The cDNA insert of plasmid pGRN121 was sequenced using techniques 
known in the art. Figure 70 provides a restriction site and function map of plasmid pGRN121 
identified based on this preliminary work. The results of this preliminary sequence analysis 
are shown in Figure 71. From this analysis, and as shown in Figure 70, a putative start site 
for the coding region was identified at approximately 50 nucleotides from the EcoKL site 
(located at position 707), and the location of the telomerase-specific motifs, "FFYVTE" , 
"PKP," " AYD," "QG", and "DD," were identified, in addition to a putative stop site at 
nucleotide #3571 (See, Figure 72, which shows the DNA and corresponding amino acid 
sequences for the open reading frames in the sequence ("a", "b", and "c"). However, due to 
the preliminary nature of the early sequencing work, the reading frames for the various motifs 
were found not to be in alignment. 

Additional analysis conducted on the pGRN121 indicated that the plasmid 
contained significant portions from the 5'-end of the coding sequence not present on clone 
712562. Furthermore, pGRN121 was found to contain a variant coding sequence that 
includes an insert of approximately 182 nucleotides. This insert was found to be absent from 
the clone. As with the E. aediculatus sequences, such variants can be tested in functional 
assays, such as telomerase assays to detect the presence of functional telomerase in a sample. 
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Further sequence analysis resolved the cDNA sequence of pGRN121 to 
provide a contiguous open reading frame that encodes a protein of molecular weight of 
approximately 127,000 daltons, and 1 132 amino acids as shown in Figure 74. A refined map 
of pGRN121 based on this analysis, is provided in Figure 73. The results of additional 
5 sequence analysis of the hTRT cDNA are presented in Figure 1 6 (SEQUENCE ID NO: 1). 

EXAMPLE 2 

CORRELATION OF hTRT ABUNDANCE AND CELL IMMORTALIT Y 

The relative abundance of hTRT mRNA was assessed in six telomerase- 

10 negative mortal cell strains and six telomerase-positive immortal cell lines (Figure 5). The 
steady state level of hTRT mRNA was significantly increased in immortal cell lines that had 
previously been shown to have active telomerase. Lower levels of the hTRT mRNA were 
detected in some telomerase-negative cell strains. 

RT-PCR for hTRT, hTR, TP1 (telomerase-associated protein related to 

1 5 Tetrahymena p80 [Harrington et al., 1 997, Science 275 :973 ; Nakayama et al., 1 997, Cell 

88:875]) and GAPDH (to normalize for equal amounts of RNA template) was carried out on 
RNA derived from the following cells: (1) human fetal lung fibroblasts GFL, (2) human fetal 
skin fibroblasts GFS, (3) adult prostate stromal fibroblasts 31 YO, (4) human fetal knee 
synovial fibroblasts HSF, (5) neonatal foreskin fibroblasts BJ, (6) human fetal lung 

20 fibroblasts IMR90, and immortalized cell lines: (7) melanoma LOX IMVI, (8) leukemia 
U251, (9) NCI H23 lung carcinoma, (10) colon adenocarcinoma SW620, (1 1) breast tumor 
MCF7, (12) 293 adenovirus El transformed human embryonic kidney cell line. 

hTRT nucleic acid was amplified from cDNA using oligonucleotide primers 
LT5 and LT6 (Table 2) for a total of 3 1 cycles (94°C 45s, 60°C 45s, 72°C 90s). GAPDH 

25 was amplified using primers KI36 (5'-CTCAGACACCATGGGGAA 

GGTGA) and K137 (5'-ATGATCTTGAGGCTGTTGTCATA) for a total of 16 cycles (94°C 
45 s, 55 °C 45 s, 72 °C 90 s). hTR was amplified using primers F3b (5'-TCTAA 
CCCTAACTGAGAAGGGCGTAG) and R3c (5^GTTTGCTCTAGAATGAACGGTG 
GAAG) for a total of 22 cycles (94 °C 45s, 55 °C 45 s, 72 °C 90s). TP1 mRNA weis 

3 0 amplified using primers TP 1 . 1 and TP 1 .2 for 28 cycles (cycles the same as hTRT). Reaction 
products were resolved on an 8% polyacrylamide gel, stained with SYBR Green (Molecular 
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Probes) and visualized by scanning on a Storm 860 (Molecular Dynamics). The results, 
shown in Figure 5, demonstrate that hTRT mRNA levels correlate directly with telomerase 
activity levels in the cells tested. 

EXAMPLE 3 

CHARACTERIZATION OF AN hTRT INTR ONIC SEQUENCE 

A putative intron was first identified by PCR amplification of human genomic 
DNA, as described in this example, and subsequently confirmed by sequencing the genomic 
clone kG$5 (see Example 4). PCR amplification was carried out using the forward primer 
TCP1.57 paired individually with the reverse primers TCP1.46, TCP1.48, TCP1.50, 
TCP1.52, TCP1.54, TCP1.56, and TCP1.58 (see Table 2). The products from genomic DNA 
of the TCP1.57/TCP1.46, TCP1.48, TCP1.50, TCP1.52, TCP1.54, or TCP1.56 amplifications 
were approximately 100 basepairs larger than the products of the pGRN121 amplifications. 
The TCP1.57/TCP1.58 amplification was the same on either genomic or pGRN121 DNA. 
This indicated .the genomic DNA contained an insertion between the sites for TCP 1.5 8 and 
TCP1.50. The PCR products of TCP1.57ATCP1.50 and TCP1.57/TCP1.52 were sequenced 
directly, without subcloning, using the primers TCP 1.39, TCP 1.57, and TCP 1.49. 

As shown below, the 104-base intronic sequence (SEQUENCE ID NO: 7) is 
inserted in the hTRT mRNA (shown in bold) at the junction corresponding to bases 274 and 
275 of Figure 16: 

CCCCCCGCCGCCCCCTCCTTCCGCCAG/GTGGGCCTCCCCGGGGTCGGCGTCCG 

GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAG 

GCGACTCAGGGCGCTTCCCCCGCAG/GTGTCCTGCCTGAAGGAGCTGGTGGCC 

CGAGTGCTGCAG 

The "/" indicates the splice junctions; the sequence shows good matches to consensus 5' and 
3' splice site sequences typical for human introns. 

This intron contains motifs characteristic of a topoisomerase II cleavage site 
and a NFkB binding site (see Figure 21). These motifs are of interest, in part, because 
expression of topoisomerase II is up regulated in most tumors. It functions to relax DNA by 
cutting and rewinding the DNA, thus increasing expression of particular genes. Inhibitors of 
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topoisomerase II have been shown to work as anti-tumor agents. In the case of NFkB, this 
transcription factor may play a role in regulation of telomerase during terminal 
differentiation, such as in early repression of telomerase during development and so is 
another target for therapeutic intervention to regulate telomerase activity in cells. 

5 

EXAMPLE 4 

CLONING OF LAMBDA PHAGE G$5 AND CHARACTERIZATION OF hTRT 

GENOMIC SEQUENCES 

10 A. Lambda G$5 

A human genomic DNA library was screened by PCR and hybridization to 
identify a genomic clone containing hTRT RNA coding sequences. The library was a human 
fibroblast genomic library made using DNA from WI38 lung fibroblast cells (Stratagene, Cat 
# 946204). In this library, partial Sau3AI fragments are ligated into the Xhol site of Lambda 

1 5 FIX®II Vector (Stratagene), with an insert size of 9-22 kb. 

The genomic library was divided into pools of 150,000 phage each, and each 
pool screened by nested PCR (outer primer pair TCP1.52 & TCP1.57; inner pair TCP1.49 & 
TCP1.50, see Table 1). These primer pairs span a putative intron (see Example 3, supra) in 
the genomic DNA of hTRT and ensured the PCR product was derived from a genomic source 

20 and not from contamination by the hTRT cDNA clone. Positive pools were further 

subdivided until a pool of 2000 phage was obtained. This pool was plated at low density and 
screened via hybridization with a DNA fragment encompassing basepairs 1552-2108 of 
Figure 16 (restriction sites SphI and EcoRV, respectively). 

Two positive clones were isolated and rescreened via nested PCR as described 

25 above; both clones were positive by PCR. One of the clones (A,G$5) was digested with NotI, 
revealing an insert size of approximately 20 kb. Subsequent mapping (see below) indicated 
the insert size was 15 kb and that phage G$5 contains approximately 13 kb of DNA upstream 
from the start site of the cDNA sequence. 

Phage G$5 was mapped by restriction enzyme digestion and DNA 

30 sequencing. The resulting map is shown in Figure 7. The phage DNA was digested with 
Ncol and the fragments cloned into pBBSl 67. The resulting subclones were screened by 
PCR to identify those containing sequence corresponding to the 5' region of the hTRT cDNA. 
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A subclone (pGRN140) containing a 9 kb Ncol fragment (with hTRT gene sequence and 4-5 
kb of lambda vector sequence) was partially sequenced to determine the orientation of the 
insert. pGRN 140 was digested using Sail to remove lambda vector sequences, resulting in 
pGRN144. pGRN144 was then sequenced. The results of the sequencing are provided in 
Figure 21. The 5' end of the hTRT mRNA corresponds to base 2441 of Figure 21. As 
indicated in Figure 7, two Alu sequence elements are located 1700 base pairs upstream of the 
hTRT cDNA 5' end and provide a likely upstream limit to the promoter region of hTRT. The 
sequence also reveals an intron positioned at bases 4173 in Figure 21, 3* to the intron 
described in Example 3, supra. 

B. Additional Genomic Clones 

In addition to the genomic clone described above, two PI bacteriophage 
clones and one human BAC clone are provided as illustrative embodiments of the invention. 
PI inserts are usually 75-100 kb, and BAC inserts are usually over 100 Kb. 

The PI clones (DMPC-HFF#1-477(F6) -GS #15371 and 
DMPC-HEF#1-1103(H6) -GS #15372) were obtained by PCR screening of ahuman PI 
library derived from human foreskin fibroblast cells (Shepherd et al., 1994, PNAS USA 
91:2629) using primers TCP1.12 and UTR2 which amplify the 3' end of hTRT. These clones 
were both negative (failed to amplify) with primers that amplify the 5' end of hTRT. 

The human BAC clone (326 E 20) was obtained with a hybridization screen of 
a BAC human genomic library using an 1 143 bp Sphl/Xmnl fragment of pGRN121 (Figure 
16; bases 1552-2695) that encompasses the RT motif region. The clone is believed to include 
the 5' end of the gene. The hTRT genomic clones in this example are believed to encompass 
the entire hTRT gene. 

EXAMPLE 5 
fHROMOSOMAT, LOCATION OF hTRT GENE 
The hTRT gene was localized to chromosome 5p by radiation hybrid mapping 
(Boehnke et al., 1991, Am J Hum Genet 49:1 174; Walter et al., 1994, Nature Genet 7:22) 
using the medium resolution Stanford G3 panel of 83 RH clones of the whole human genome 
(created at the Stanford Human Genome Center). A human lymphoblastoid cell line (donor; 
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rM) was exposed to 10,000 rad of x-rays and was then fused with nonirradiated hamster 
recipient cells (A3). Eighty-three independent somatic cell hybrid clones were isolated, and 
each represents a fusion event between an irradiated donor cell and a recipient hamster cell. 
The panel of G3 DNA was used for ordering markers in the region of interest as well as 

5 establishing the distance between these markers. 

The primers used for the RH mapping were TCP1 . 12 and UTR2 with 
amplification conditions of 94°C 45 sec, 55°C 45 sec, 72°C 45 sec, for 45 cycles using 
Boehringer Mannheim Taq buffer and Perkin-Elmer Taq. The 83 pools were amplified 
independently and 14 (17%) scored positive for hTRT (by appearance of a 346 bp band). The 

1 0 amplification results were submitted to Stanford RH server, which then provided the map 
location, 5p, and the closest marker, STS D5S678. 

By querying the Genethon genome mapping web site, the map location 
identified a YAC that contains the STS marker D5S678: CEPH YAC 780_C_3 Size: 
390,660 kb. This YAC also contained chromosome 17 markers. This result indicated that 

15 the hTRT gene is on chromosome 5, near the telomeric end. There are increased copy 
numbers of 5p in a number of tumors. Cri-du-chat syndrome also has been mapped to 
deletions in this region. 

EXAMPLE 6 

20 DESIGN AND CONSTRUCTION OF VECTORS FO R EXPRESSION OF hTRT 

PROTEINS AND POLYNUCLEOTIDES 
Expression of hTRT in Bacteria 

The following portion of this example details the design of hTRT-expressing 
bacterial and eukaryotic cell expression vectors to produce large quantities of full-length, 

25 biologically active hTRT. Generation of biologically active hTRT protein in this manner is 
useful for telomerase reconstitution assays, assaying for telomerase activity modulators, 
analysis of the activity of newly isolated species of hTRT, identifying and isolating 
compounds which specifically associate with hTRT, analysis of the activity of an hTRT 
variant protein that has been site-specifically mutated, and as an immunogen, as a few 

30 examples. 
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pThioHis A/hTRT Bacterial Expression Vector 

To produce large quantities of full-length hTRT, the bacterial expression 
vector pThioHis A (Invitrogen, San Diego, CA) was selected as an expression system. The 
hTRT-coding insert includes nucleotides 707 to 4776 of the hTRT insert in the plasmid 
pGRN121. This nucleotide sequence includes the complete coding sequence for the hTRT 
protein. 

This expression vector of the invention is designed for inducible expression in 
bacteria. The vector can be induced to express, in E. coli, high levels of a fusion protein 
composed of a cleavable, HIS tagged thioredoxin moiety and the full length hTRT protein. 
The use of the expression system was in substantial accordance with the manufacturer's 
instructions. The amino acid sequence of the fusion protein encoded by the resulting vector 
of the invention is shown below; (-*-) denotes an enterokinase cleavage site: 
MSDKIIHLTDDSFDTDVLKADGAILVDFWAHWCGPCKMIAPILDEIADEYQGKLTVAKLRID 

HNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGSGDDDDK- * -VP 

MHELEIFEFAAASTQRCVLLRTWEALAPATPAMPRAPRCRAVRSLLRSHYREVLPLATFVRR 

LGPQGWRLVQRGDPAAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERG 

AKNVIJ^GFALLDGARGGPPEAFTTSWSYLPlSrTVTDALRGSGAWGLLLRRVGDDVljVHLIiA 

RCALFVLVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGL 

PAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCWSPARP 

AEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQ 

LRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQ 

CP YGVLLKTHC PLRAAVTPAAGVCAREKPQGS VAAPEEEDTDPRRLVQLLRQHS S PWQVYGF 

VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLIRRSPGV 

GCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG 

IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRF I PKPDGLRP I VNMDYWGART FRREKR 
AERLTSRVKALFSVTjNYERARRPGLLGASVLGLDDIHRAWRTFVTjRVRAQDPPPELYFVKVD 
VTGAYDTIPQDRLTEVIAS 1 1 KPQNTYCVRRYAWQKAAHGHVRKAFKSHVS TLTDLQ PYMR 
QFVAHLQETSPLRDAWIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSI 
LSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCW 
NLRKTWNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTF 
NRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFH 



229 



QQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTR 
HRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD 

pGEX-2TK with hTRT Nucleotides 3272 to 4177 of pGRN121 

5 This construct of the invention is used to produce fusion protein for, e.g., the 

purpose of raising polyclonal and monoclonal antibodies to hTRT protein. Fragments of 
hTRT can also be used for other purposes, such as to modulate telomerase activity, for 
example, as a dominant-negative mutant or to prevent the association of a telomerase 
component with other proteins or nucleic acids. 

10 To produce large quantities of an hTRT protein fragment, the E. coli 

expression vector pGEX-2TK (Pharmacia Biotech, Piscataway N.J) was selected, and used 
essentially according to manufacturer's instructions to make an expression vector of the 
invention. The resulting construct contains an insert derived from nucleotides 3272 to 4177 
of the hTRT insert in the plasmid pGRN121. The vector directs expression in E. coli of high 

1 5 levels of a fusion protein composed of glutathione-S-transferase sequence (underlined below), 
thrombin cleavage sequence (double underlined), recognition sequence for heart muscle 
protein kinase (italicized), residues introduced by cloning in brackets ([GSVTK]) and hTRT 
protein fragment (in bold) as shown below: 

|V[ff P T T ■fTyWK' T TCGTjVOPTP T iI/LE YT ■FFTTYRHHT lYF.RDEGDKWRNKKFFiT iGT iF,F PNIjP YY I PGP 

20 vyT/rOSMAITPVTADTmNMT.nttCPKKR^^^ 

TCT .PP.MT iTCMFEDRT .rHTCTYLNGDHVTHPDFMT .YDALDWT .VMDPMCLD AFPKT .VCFKKRIEAI 
POTDTCYLKS.qKYTAWPLOGWnATFGGGD RPPK-SDLVPRGSigigASVrGSVTKl IPQGSILSTL 
LCSLCYGDMENKLFAGIRRDGLLLRLV13DFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRK 
TWNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRAS'WTFNRGF 

25 KAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVW 
KNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQVJLCHQAFLLKI.TRHRVT 
YVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD 

When this fusion protein was expressed, it formed insoluble aggregates. It 
30 was treated generally as described above, in the section entitled purification of proteins from 
inclusion bodies. Specifically, induced cells were suspended in PBS (20 mM sodium 
phosphate, pH 7.4, 150 mM NaCl) and disrupted by sonication. NP-40 was added to 0.1%, 
and the mixture was incubated for 30 minutes at 4°C with gentle mixing. The insoluble 
material was collected by centrifugation at 25,000g for 30 minutes at 4°C. The insoluble 
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material was washed once in 4M urea in PBS, collected by centrifugation, then washed again 
in PBS. The collected pellet was estimated to contain greater than 75% fusion protein. This 
material was dried in a speed vacuum, then suspended in adjuvant for injection into mice and 
rabbits for the generation of antibodies. Separation of the recombinant protein from the 
5 glutathione S-transferase moiety is accomplished by site-specific proteolysis using thrombin 
according to manufacturer's instructions. 



p GEX-2TK with h TRT Nucleotides 2426 to 3274 ofpGRN121 with HISS Tag 

To produce large quantities of a fragment of hTRT, another E. coli expression 

1 0 vector pGEX-2TK construct was prepared. This construct contains an insert derived from 
nucleotides 2426 to 3274 of the hTRT insert in the plasmid pGRN121 and a sequence 
encoding eight consecutive histidine residues (HIS-8 Tag). To insert the fflS-8 TAG, the 
pGEX-2TK vector with hTRT nucleotides 2426 to 3274 of pGRN121 was linearized with 
BamHl . This opened the plasmid at the junction between the GST-thrombin-heart muscle 

1 5 protein kinase and the hTRT coding sequence. A double stranded oligonucleotide with 
BamHl compatible ends was ligated to the linearized plasmid resulting in the in-fraine 
introduction of eight histidine residues upstream of the hTRT sequence. 

The vector directs expression in E coli of high levels of a fusion protein 
composed of glutathione-S-transferase sequence (underlined); thrombin cleavage sequence 

20 (double underlined); recognition sequence for heart muscle protein kinase (italicized); a set of 
three and a set of five residues introduced by cloning are in brackets ([GSV] and [GSVTK]); 
eight consecutive histidines (also double underlined); and hTRT protein fragment (in bold): 
MSPTTnvWTKGT.VOPTRTJJ.EYLFF.KYKFmYER nKGnKWRNKKFFJ,GT,RFPNLPY 
VTOGTWKT TOSMAITR YT ADKHNMT .GGCPKERA F.TSMT .FGAVT ,DTR YGVSRTAYSKDF 

25 F.TT .KVDFLSKT PF.MT .KMFED P T .CHKTYT .NGDFTVTHPDFMT , YD A T ,D VVT , YMQPMCL 
D A FPKT VCFKKRIF AIPOIDKYI ,K SSKYTAWPT ,OGWO ATF GGGDHPPK SDLVPRGSi?i? 
^5nGSV]HJfflHJ^HH[GSVTK]MSvYWELLRSFFYvTETTFQKNRLFFYRPSVWS 

YWGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAW 
RTFVLRVRA.QDPPPELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYA 
WQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWIEQSSSL 

NEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGI 

Each of the pGEX-2TK vectors of the invention can be used to produce fusion 
3 5 protein for the purpose of raising polyclonal and monoclonal antibodies to hTRT protein. 
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Additionally, this fusion protein can be used to affinity purify antibodies raised to hTRT 
peptides that are encompassed within the fusion protein. Separation of the recombinant 
protein from the glutathione S-transferase moiety can be accomplished by site-specific 
proteolysis using thrombin according to manufacturer's instructions. 

5 

pGEX-2TK with hTRT Nucleotides 2426 to 3274 ofpGRN121, no. HIS-8 Tag 

To produce large quantities of a fragment of hTRT, another E. coli expression 
vector pGEX-2TK construct was prepared. 

This construct contains an insert derived from nucleotides 2426 to 3274 of the 
10 hTRT insert in the plasmid pGRN121, but without the HIS-8 tag of the construct described 
above. The vector directs expression in E coli of high levels of a fusion protein composed of 
glutathione-S-transferase (underlined), thrombin cleavage sequence (double underlined), 
recognition sequence for heart muscle protein kinase (italicized), residues introduced by 
cloning in brackets ([GSVTK]) and hTRT protein fragment (in bold): 

15 M.qPTT^YWKT^T.vnPTRLTJ.KVT.F.RKYE F .HT.VF.RnF.GDKWRNKKFELGT.F.FPNT.PYYIpgD 
WT .TOSMft T TP. YI ADTmTJMT ,nr,C!PK F R RF T SMT .F.GAVT ,DT RYGVSR T AYSKDFF,TLKVPFLS 
TfT.PFMT.TmFFnRLCHyTYT.NOnTWT TTPnFMT.YDM.DVVT.YMDPMrT.PfiFPKTiVCFKKRIEAX 

POTnTCYT.KSSTfVT&wPT,nnwnATFnr,r,DHP Py.qDT J vPRGSRRAgy[GSVTK] msvywellr 
SFFYVTETTFQKNRLFFYRPSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSR 
20 LRFIPKPDGLRPIVNMDYWGARTFRREKRAERLTSRKALFSVLNYERARRPGLLGASVLGL 
DDIHRAWRTFVXRVRAQDPPPEYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAV 
VQKAAHGVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWIEQSSSLNEASGLFDVFL 

RFMCHHAVRIRGKSYVQCQGI 

25 pGEX-2TK with hTRT Nucleotides 1625 to 2458 ofpGRN121 

To produce large quantities of a fragment of hTRT protein, another E. coli 
expression vector pGEX-2TK construct was prepared. 

This construct contains an insert derived from nucleotides 1625 to 2458 of the 
hTRT insert in the plasmid pGRN121. The vector directs expression in E coli of high levels 
30 of a fusion protein composed of glutathione-S-transferase, (underlined), thrombin cleavage 
sequence (double underlined), recognition sequence for heart muscle protein kinase 
(italicized) residues introduced by cloning in brackets ([GSVTK]) and hTRT protein 
fragment (in bold): 

MSPTT.GYWKTKGLVnPTRTJJ.EYT.F-FKYKRHLY F.T?r)RnDKWRNKKFKT,Gr,EFPNLPY 
35 VTDGDVKLTOSMATTRYTADKHNMT.GGCPK F.R AFISMLEG A VT DTR YGVSRIAYSKDF 
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ETLKVDFLSKLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDAT.DVVLYMDPMCL 
DAFPKLVCFKKRIEAIPOIDKYLKSSKYIAWPLOGWOATFGGGDHPPKSDLVPRGSJ gi? 
^5F[GSVTK]ATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAE 
TKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRL\^TIFLGSRPWMPGTPR]EILPRL 
5 PQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGS 
VAAPEEEDTDPRRLVQLLRQHSSPWQWGFVRACLRRLVPPGLWGSRHNERRF 
LRNTKETISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHB1REEI 
LAKFLHWLMSVYWELLRS 



10 pGEX-2TK with hTRT Nucleotides 782 to 1636 ofpGRN121 

To produce large quantities of a fragment of hTRT protein, another E. colt 
expression vector pGEX-2TK construct was prepared. 

This construct contains an insert derived from nucleotides 782 to 1636 of the 
hTRT insert in the plasmid pGRN121 . The vector directs expression in E coli of high levels 
15 of a fusion protein composed of glutathione-S-transferase, (underlined), thrombin cleavage 
sequence (double underlined), recognition sequence for heart muscle protein kinase 
(italicized) residues introduced by cloning in brackets ([GSVTK]) and hTRT protein 
fragment (in bold): 

M$ P I LgYWK J K(gLVQ PTRLLLE YLgEKYEEHIf Y ERDE(?D KWRNKKFELQIjE; F PNL P YY J PgD 
20 VKLTQSMAIIRYIADKHNMLGGCPKERAEISMLEGAVLDIRYGVSRIAYSKDFETLKVDFLS 
KLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDALDVVLYMDPMCLDAFPKLVCFKKRIEAI 
POIDKYLKSSKYIAWPLOGWOATFGGGDHPPKSDLVPRG5R J?AgVrGSVTKl MPRAPRCRAV 
RSLLSHYRETVXPIATFVRRLGPQG^ 

SCLKELVARVIiQRLCERGAKNVLAFGFALLDGARGGPPEATTSVRSYLPNTVTDALRGSGAW 
25 GLLLRRVGDDVLVHLLARCALFVLVAPCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCE 
RAWNHSVREAGVPLGLPAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRT 
RGPSDRGFCWSPARPAEEATSL 

pT7FLhTRT with hTRTcDNA Lacking 5 '-Non-Coding Sequence 
30 As described above, in one embodiment, the invention provides for gin hTRT 

that is modified in a site-specific manner to facilitate cloning into bacterial, mamnu dian, yeast 
and insect expression vectors without any 5* untranslated hTRT sequence. In some 
circumstances, minimizing the amount of non-protein encoding sequence allows for 
improved protein production (yield) and increased mRN A stability. In this embodiment of 
35 the invention, the hTRT gene's 5 1 non-coding region was removed before cloning into a 
bacterial expression vector. 

This was effected by engineering an additional restriction endonuclease site 
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just upstream (5') to the start (ATG) codon of the hTRT coding sequence (Figure 16). The 
creation of a restriction site just 5 ! to the coding region of the protein allows for efficient 
production of a wide variety of vectors that encode fusion proteins, such as fusion proteins 
comprising labels and peptide TAGs, for immunodetection and purification. 
5 Specifically, the oligonucleotide 

5'- CCGGCCACCCCCCATATGCCGCGCGCTCCC-3 , was used as described above to 
modify hTRT cDNA nucleotides 779 to 78 1 of the hTRT cDNA (Figure 16) from GCG to 
CAT. These 3 nucleotides are the last nucleotides before the ATG start codon so they do not 
modify the protein sequence. The change in sequence results in the creation of a unique Ndel 

10 restriction site in the hTRT cDNA. Single-stranded hTRT DNA was used as a DNA source 
for the site directed mutagenesis. The resulting plasmid was sequenced to confirm the 
success of the mutagenesis. 

This modification allowed the construction of the following plasmid of the 
invention, designated pT7FLhTRT. The site-specifically modified hTRT sequence (addition 

1 5 of the Ndel restriction site) was digested with Ndel and NotI (and filled in with Klenow 

enzyme to generate blunt ended DNA) to generate an hTRT encoding nucleic acid fragment 
The fragment was then cloned into a pSL341 8 plasmid previously restriction digested with 
Ndel and Smal (also a blunt ended cutter). pSL 3418 is a modified pAED4 plasmid into 
which a FLAG sequence (Immunex Corp, Seattle WA) and an enterokinase sequence are 

20 inserted just upstream from the above-referenced Ndel site. This plasmid, designated 

pT7FLhTR, allows the expression of full length hTRT (with a Flag-Tag at its 5 ! end) in an 
E.coli strain expressing the T7 RNA polymerase. 

Plasmids with hTRT cDNA Lacking 3*-Non-Coding Sequence 

25 As discussed above, the invention provides for expression vectors containing 

TRT-encoding nucleic acids in which some or all non-coding sequences have been deleted. 
In some circumstances, minimizing the amount of non-protein encoding sequence allows for 
improved protein production (yield) and increases mRNA stability. In this embodiment of 
the invention, the 3' untranslated region of hTRT is deleted before cloning into a bacterial 

30 expression plasmid. 

The plasmid pGRN121, containing the full length hTRT cDNA, as discussed 
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above, was first deleted of all Apal sites. This was followed by deletion of the MscL-HincII 
hTRT restriction digest enzyme fragment containing the 3'UTR. The Ncol-Xbal restriction 
digest fragment containing the stop codon of hTRT was then inserted into the Ncol-Xbal site 
of pGRN121 to make a plasmid equivalent to pGRN121, designated pGRN124, except 
lacking the 3'UTR. 

Bacterial Expression Vectors Using Antibiotic Selection Markers 

The invention also provides for bacterial expression vectors that can contain 
selection markers to confer a selectable phenotype on transformed cells and sequences coding 
for episomal maintenance and replication such that integration into the host genome is not 
required. For example, the marker may encode antibiotic resistance, particularly resistance to 
chloramphenicol (see Harrod (1997) Nucleic Acids Res. 25: 1720-1726), kanamycin, G418, 
bleomycin and hygromycin, to permit selection of those cells transformed with the desired 
DNA sequences, see for example, Blondelet-Rouault (1997) Gene 190:315-317; and Mahan 
(1995) Proc Natl Acad Sci USA 92:669-673. 

In one embodiment of the invention, the full length hTRT was cloned into a 
modified BlueScript plasmid vector (Stratagene, San Diego, CA), designated pBBS235, into 
which a chloramphenicol antibiotic resistence gene had been inserted. The NotI fragment 
from pGRN124 (discussed above) containing the hTRT ORF into the NotI site of pBBS235 
so that the TRT ORF is in the opposite orientation of the vector's Lac promoter. This makes a 
plasmid that is suitable for mutageneis of plasmid inserts, such as TRT nucleic acids of the 
invention. This plasmid construct, designated pGRN125, can be used in the methods of the 
invention involving mutagenesis of telomerase enzyme and TRT protein coding sequences 
and for in vitro transcription of hTRT using the T7 promoter (and in vitro transcription of 
antisense hTRT using the T3 promoter). 

In another embodiment of the invention, NotI restriction digest fragments from 
pGRN124 containing the hTRT ORF were subcloned into the NotI site of pBBS235 
(described above) so the TRT ORF is in the same orientation as the vector's Lac promoter. 
This makes a plasmid, designated pGRN126, that can be used for expression of full length 
hTRT in E. colL The expressed product will contain 29 amino acids encoded by the vector 
pBBS235, followed by 18 amino acids encoded by the 5'UTR of hTRT, followed by the full 
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length hTRT protein. 

In a further embodiment of the invention, in vitro mutagenesis of pGRN125 
was done to convert the hTRT initiating ATG codon into a Kozak consensus and create 
EcoRI and Bglll restriction digest sites to facilitate cloning into expression vectors. The 
oligonucleotide 

S'-TGCGCACGTGGGAAGCCCTGGCagatctgAattCcaCcATGCCGCGCGCTCCCCGCTG- 
y (altered nucleotides in lower case) was used in the mutagenesis procedure. The resulting 
expression vector was designated pGRN127. 

In another embodiment of the invention, the second Asp of the TRT "DD 
motif was converted to an alanine to create a non-functional telomerse enzyme, thus creating 
a mutant TRT protein for use as a dominant/negative mutant. The hTRT coding sequence 
was mutagenized in vitro using the oligonucleotide 5 ! - 

CGGGACGGGCTGCTCCTGCGTTTGGTGGAcGcgTTCTTGTTGGTGACACCTCACCT 
CACC-3' to convert the asparagine codon for residue 869 (Asp869) to an alanine (Ala) 
codon. This also created an Mlul restriction enzyme site. The resulting expression plasmid 
was designated pGRN130, which also contains the Kozak consensus sequence as described 
forpGRN127. 

The invention also provides a vector designed to express an antisense 
sequence fragment of hTRT. The pGRN126 plasmid was cut to completion with MscI and 
Smal restriction enzymes and religated to delete over 95% of the hTRT ORF. One 
Smal-MscI fragment was re-inserted during the process to recreate CAT activity. This 
unpurified plasmid was then redigested with Sail and EcoRI and the fragment containing the 
initiating codon of the hTRT ORF was inserted into the Sall-EcoRI sites of pBBS212 to 
make an antisense expression plasmid expressing the antisense sequence spanning the 5'UTR 
and 73 bases pair residues of the hTRT ORF (in mammalian cells). This plasmid was 
designated pGRN135. 
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Expression of hTRT Telomerase in Yeast 

The present invention also provides hTRT-expressing yeast expression vectors 
to produce large quantities of full-length, biologically active hTRT. 

Pichia pastoris Expression Vector pPICZ B and Full Length hTRT 

To produce large quantities of full-length, biologically active hTRT , the Picha 
pastoris expression vector pPICZ B (Invitrogen, San Diego, CA) was selected. The hTRT- 
coding sequence insert was derived from nucleotides 659 to 4801 of the hTRT insert in 
plasmid pGRN121. This nucleotide sequence includes the full-length sequence encoding 
hTRT. This expression vector is designed for inducible expression in P. pastoris of high 
levels of full-length, unmodified hTRT protein. Expression is driven by a yeast promoter, 
but the expressed sequence utilizes the hTRT initiation and termination codons. No 
exogenous codons were introduced by the cloning. The resulting pPICZ B/hTRT vector was 
used to transform the yeast. 

Pirhia pastoris Expression Vector hTRT-His6/pPICZ B 

A second Picha pastoris expression vector of the invention derived from 
pPICZ B, also contains the full-length sequence encoding hTRT derived from nucleotides 
659 to 4801 of the hTRT insert in the plasmid pGRN121. This hTRT-His6/pPICZ B 
expression vector encodes full length hTRT protein fused at its C-terminus to the Myc 
epitope and His6 reporter tag sequences. The hTRT stop codon has been removed and 
replaced by vector sequences encoding the Myc epitope and the His6 reporter tag as well as a 
stop codon. This vector is designed to direct high-level inducible expression in yeast of the 
following fusion protein, which consists of hTRT sequence (underlined), vector sequences in 
brackets ([L] and [NSAVD]) the Myc epitope (double underlined), and the His6 tag 
(italicized): 

MPR APRCR A VR ST ,T JRSHYR EVT ,PLATFVR RT .GPOGW RT .VORGDPA AFR ALV AQCLV 
rVPWnARPPPAAPSFROVSCT.KELVARVT-ORT.CERG AKNVT AFnFAT.T.DGARGGPP 

KAFTTSVRSYT.PNTVTDAT .ROSOA WGT J 1 RR VGDDVT .VWT.T.ARP.AT .FVT ,VAPSCAY 
OVnGPPLYOT GA ATOARPPPH ASGPRRR T .GCER A WNHS VR E A GVPT ,GT ,P APGARRR 
GGSASRSLPTPKRPRRGAAPEPERTPVGOGSWAHPG PTRr.PRnRrTFrVVSPARPAEE 

ATST J EGALSGTRHSHPSVGROHHAGPPSTSRPPRPW r>TPrpPVV AFTKFfFT .YSSGDK 
F.nT.RPSFLT.SST,RPSLTGARRT.VETTFr.GSRPWMPG TPRRT.PRTPORYWOMRPLFLEL 
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LGNHAOCPYGVLLKTHCPLRAAVTPAAGVCAREKPOGSVAAPEEEDTDPRRLVOLL 

ROHSSPWOVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFTSr.GKHAKLSLOETT 

WKMSVRDCAWLRRSPGVGCVPAAEHRLRE^ 

FTTFOKNRLFFYRKSVWSKLOSIGIROHLKRVOLRELSEAEVROHREARPALLTSRLR 
5 FTPKPDGLRPIVNMDWVGARTFRREKRAERLTSRVKALFSVLNYERARRPGTJ.GAS 
VLGLDDIHRAWRTFVLRVRAODPPPELYFVKVDVTGAYDTIPODRLTEVIASIIKPON 
TYCVRRYAWOKAAHGHVRKAFKSHVSTLTDLOPYMROFVAHLOETSPLRDAVVIE 
QSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVOCOGIPOGSILSTLLCSLCYGDMEN 
KT.FAGTRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKWVNFP 
10 VEDEALGGTAFVOMPAHGLFPWCGLLLDTRTLEVOSDYSSYARTSIRASLTR<rRGFK 
A GRNMRRKLFGVLRLKCHSLFLDLO VNSLO WCTM YXILLLO A YRFH A C VT ,OT ,PFH 
QOVWKNPTFFLRVTSDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVOWT.CHOAF 
TJXTTRHRVTYVPLI^tSLRTAOTOLSRKLPGTTLTALEAAANPALPSDFKTILD |I,]^ 
KLISEEDL rNSAVDWiHffg/i/ 

15 

Expression of hTRT in Insect Cells 

The present invention also provides hTRT telomerase-expressing insect cell 
expression vectors that produce large quantities of full-length, biologically active hTRT. 



20 Baculovirus Expression Vector pVL1393 and Full Length hTRT 

The telomerase coding sequence of interest was cloned into the baculovirus 
expression vector pVL1393 (Invitrogen, San Diego, CA). This construct was subsequently 
cotransfected into Spodoptera fungupeida (sf-9) cells with linearized DNA from Autograph 
California nuclear polyhedrosis virus (Baculogold-AcMNPV). The recombinant 
25 baculoviruses obtained were subsequently plaque purified and expanded following standard 
protocols. 

This expression vector provides for expression in insect cells of high levels of 
full-length hTRT protein. Expression is driven by a baculoviral polyhedrin gene promoter. 
No exogenous codons were introduced by the cloning. 

30 Baculovirus Expression Vector pBlueBacHis2 B and Full Length hTRT 

To produce large quantities of full-length, biologically active hTRT, the 
baculovirus expression vector pBlueBacHis2 B (Invitrogen, San Diego, CA) was selected as a 
source of control elements. The hTRT-coding insert consisted of nucleotides 707 to 4776 of 
the hTRT insert in plasmid pGRN121 . 

35 A full length hTRT with a His6 and Anti-Xpress tags (Invitrogen) was also 

constructed. This vector also contains an insert consisting of nucleotides 707 to 4776 of the 
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hTRT insert from the plasmid pGRN121. The vector directs expression in insect cells of high 
levels of full length hTRT protein fused to a cleavable 6-histidine and Anti-Xpress tags, and 
the amino acid sequence of the fusion protein is shown below; (-*-) denotes enterokinase 
cleavage site: 

5 MPRGSPIHHHrIHGMASMTGGQQMGRDLYDDDDL-*-DPSSRSAAGTMEFAAA 
STQRCVLLRTWEALAPATPAMPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGW 
RLVQRGDPAAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERG 
AKNVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALRGSGAWGLLLRRVGDDV 
LVHLLARCALFVLVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWN 

10 HSVREAGVPLGLPAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGR 
TRGPSDRGFCWSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDT 
PCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRR 
LPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGS 
VAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRN 

1 5 TKKFISLGKHAKLSLQELTWKMSVRDC AWLRRSPGVGCVPAAEHRLREEIL AKFLH 
WLMSVYWELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSE 
AEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKA 
LFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVIDVTGA 
YDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTELQPYM 

20 RQFVAHLQETSPLRDAWIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGI 
PQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVR 
GVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSD 
YSSYARTSIRASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQWCTNIY 
KILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAK 

25 GAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALE 
AAANPALPSDFKTILD 

Baculovirus Expression Vector pBlueBac4.5 and Full Length hTRT Protein 

To produce large quantities of full-length, biologically active hTRT, a second 
30 baculovirus expression vector, pBlueBac4.5 (Invitrogen, San Diego, CA) was constructed. 
The hTRT-coding insert also consisted of nucleotides 707 to 4776 of the hTRT from the 
plasmid pGRN121. 
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Baculovirus Expression Vector pMelBacB and Full Length hTRT Protein 

To produce large quantities of full-length, biologically active hTRT, a third 
baculovirus expression vector, pMelBacB (Invitrogen, San Diego, CA) was constructed. The 
hTRT-coding insert also consists of nucleotides 707 to 4776 of the hTRT insert from the 
5 plasmidpGRN121. 

pMelBacB directs expression of full length hTRT in insect cells to the 
extracellular medium through the secretory pathway using the melittin signal sequence. High 
levels of full length hTRT are thus secreted. The melittin signal sequence is cleaved upon 
excretion, but is part of the protein pool that remains intracellularly. For that reason, it is 
1 0 indicated in parentheses in the following sequence. The sequence of the fusion protein 

encoded by the vector is shown below: 

(MKFLVNVALVFMVVYISYIYA)-*-DPSSRSAAGTMEFAAASTQRCVLLRTWE 
ALAPATPAMPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDPAAFR 
ALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERGAKNVLAFGFALL 

1 5 DGARGGPPEAFTTSVRSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLLARCALFV 
LVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGL 
PAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCVV 
SPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHF 
LYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQM 

20 RPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPR 
RLVQLLRQHSSPWQVYGFVRACLRRLWPGLWGSRHNERRFLRNTKKFISLGKHAK 
LSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELL 
RSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPA 
LLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRP 

25 GLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIA 
SIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPL 
RDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLC 
YGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLR 
KTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASL 

30 TFT4RGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQWCTMYKILLLQAYRFHAC 
VLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQ 
WLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFK 

TILD 



35 

Expression of hTRT in Mammalian Cells 

The present invention also provides vectors to produce hTRT in large 
quantities as full-length, biologically active protein in a variety of mammalian cell lines, 
which is useful in many embodiments of the invention, as discussed above. 
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MPSV-hTRT Expression Plasmids 

The invention also provides for an expression system for use in mammalian 
cells that gives the highest possible expression of recombinant protein, such as telomerase, 
5 without actually modifying the coding sequence (e.g. optimizing codon usage). In one 

embodiment, the invention provides MPSV mammalian expression plasmids (from plasmid 
pBBS212, described as pMPSV-TM in Lin J-H (1994) Gene 47:287-292) capable of 
expressing the TRTs of the invention. The MPSV plasmids can be expressed either as stable 
or transient clones. 

10 In this expression system, while the hTRT coding sequence itself is 

unchanged, exogenous transcriptional control elements are incorporated into the vector. The 
myeloproliferative sarcoma virus (MPSV) LTR (MPSV-LTR) promoter, enhanced by the 
cytomegalovirus (CMV) enhancer, is incorporated for transcriptional initiation. This 
promoter consistently shows higher expression levels in cell lines (see Lin J-H (1994) supra). 

15 A Kozak consensus sequence can be incorporated for translation initiation (see Kozak (1 996) 
Mamm. Genome 7:563-574). All extraneous 5 1 and 3 ! untranslated hTRT sequences can be 
removed to insure that these sequences do not interfere with expression, as discussed above. 
The MPSV plasmid containing the complete hTRT coding sequence, with all extraneous 
sequences included, is designated pGRN133. A control, hTRT "antisense" plasmid was also 

20 constructed. This vector is identical to pGRN133 except that the TRT insert is the antisense 
sequence of hTRT (the antisense, which control can be used as a vector is designated 
pGRN134). The MPSV plasmid containing the complete hTRT coding sequence with all 
other extraneous sequences removed and containing the Kozak consensus sequence is 
designated pGRN145. 

25 Two selection markers, PAC (Puromycin-N-acetyl-transferase = Puromycin 

resistance) and HygB (Hygromycin B = Hygromycin resistance) are present for selection of 
the plasmids after transfection (see discussion referring to selectable markers, above). 
Double selection using markers on both sides of the vector polylinker should increase the 
stability of the hTRT coding sequence. A DHFR (dihydrofolate reductase) encoding 

30 sequence is included to allow amplification of the expression cassette after stable clones are 
made. Other means of gene amplification can also be used to increase recombinant protein 
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yields. 

The invention also provides for MPSV mammalian expression plasmids 
containing hTRT fusion proteins. In one embodiment, the hTRT sequence, while retaining its 
5* untranslated region, is linked to an epitope flag, such as the IBI FLAG (International 
5 Biotechnologies Inc. (IBI), Kodak, New Haven, CT) and inserted into the MPSV expression 
plasmid (designated pGRN147). This particular constuct contains a Kozak translation 
initiation site. The expressed fusion protein can be purified using the M-l anti-FLAG 
octapeptide monoclonal antibody (IBI, Kodak, supra). 

In another embodiment, hTRT is site-specifically altered. One amino acid 

10 residue codon is mutagenized, changing the aspartic acid at position 869 to an alanine. This 
Asp869->Ala hTRT mutant, retaining its 5 r untranslated region and incorporating a Kozak 
sequence, was inserted into an MPSV expression plasmid, and designated pGRN146. The 
Asp869->Ala hTRT mutant was further engineered to contain the FLAG sequence, as 
described above, and the insert cloned into an MPSV expression plasmid. One such 

1 5 expression plasmid is designated pGRNl 54-L Specifically, for pGRNl 54-1, an Eaml 1 051 
restriction digest fragment from pGRNl 46 containing the Kozak sequence-containing "front 
end" (5 f segment) of hTRT is cloned into the Eaml 1051 sites of pGRN147 (see above) to 
make an MPSV expression plasmid capable of expressing hTRT with a Kozak sequence, the 
above-described D869->A mutation, and the IBI flag. 

20 Another embodiment of the invention is an expression plasmid derived from 

pGRN146. The mammalian expression plasmid, designated pGRNl 52, was generated by 
excising the EcoRI fragment from plasmid pGRNl 46 (containing the hTRT ORF) and cloned 
into the EcoRI site of pBBS212 to remove the 5'UTR of hTRT. The hTRT is oriented so that 
its expression is controlled by the MPSV promoter. This makes a mammalian expression 

25 plasmid that expresses hTRT with a Kozak consensus sequence and the D869->A mutation, 
and uses the MPSV promoter. 

The invention provides for a mammalian expression vector in which hTRT is 
oriented so that the hTRT coding sequence is driven by the MPSV promoter. For example, 
an EcoRI restriction digest fragment from pGRNl 37 containing the hTRT open reading 

30 frame (ORF) was cloned into the EcoRI site of pBBS212 (see below), thus removing the 5 f 
untranslated region (5'-UTR) of hTRT. pGRNl 37 was constructed by excising a 
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SalI-Sse8387I fragment from pGRN130, described below, containing the Kozak mutation of 
hTRT into the Sal 1-SSE 83871 sites of pGRN136, making a mammalian expression plasmid 
expressing hTRT containing a Kozak consensus sequence off the MPSV promoter. Plasmid 
pGRN136 was constructed by excising a Hindlll Sail fragment from pGRN126 containing 
5 the hTRT ORF and cloning it into the Hindlll Sail sites of plasmid, pBBS242, making a 
mammalian expression plasmid expressing hTRT off the MPSV promoter). This makes a 
mammalian expression plasmid, designated pGRN145, that expresses hTRT with a Kozak 
consensus sequence using the MPSV promoter. See also the pGRN152 MPSV promoter- 
driven mammalian expression vector described below. 

10 

hTRT Expressed in 293 Cells using Episomal Vector pEBVHis 

An episomal vector, pEBVHis (Invitrogen, San Diego, CA) was engineered to 
express an hTRT fusion protein comprising hTRT fused to an N-terminal extension epitope 
tag, the Xpress epitope (Invitrogen, San Diego, CA) (designated pGRN122). The NotI hTRT 
1 5 fragment from pGRN 1 2 1 containing the hTRT ORF was cloned into the NotI site of 

pEBVHisA so that the hTRT ORF is in the same orientation as the vector's Rous Sarcoma 
Virus (RS V) promoter. In this orientation the His6 flag was relatively closer to the 

N-terminus of hTRT. 

A vector was also constructed containing as an insert the antisense sequence of 

20 hTRT and the epitope tag (the plasmid designated pGRN123, which can be used as a control). 
The vector was transfected into 293 cells and translated hTRT identified and isolated using an 
antibody specific for the Xpress epitope. pEBVHis is a hygromycin resistant EBV episomal 
vector that expresses the protein of interest fused to a N-terminal peptide. Cells carrying the 
vector are selected and expanded, then nuclear and cytoplasmic extracts prepared. These and 

25 control extracts are immunoprecipitated with anti-Xpress antibody, and the 

immunoprecipitated beads are tested for telomerase activity by conventional assay. 
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Expressionm of Recombinant hTRT in Mortal, Normal Diploid Human Cells 

In one embodiment of the invention, recombinant hTRT and necessary 
telomerase enzyme complex components can be expressed in normal, diploid mortal cells to 
increase their proliferative capacity or to immortalize them, or to facilitate immortalizing 
5 them. This allows one to obtain diploid immortal cells with an otherwise normal phenotype 
and karotype. As discussed above, this use of telomerase has enormous commercial utility. 

Sense hTRT (Figure 16) and antisense hTRT were cloned into a CMV vector. 
These vectors were purified and transiently transfected into two normal, mortal, diploid 
human cell clones. The human clones were young passage diploid human BJ and IMR90 cell 
10 strains. 

Analysis of telomerase activity using a TRAP assay (utilizing the TRAPeze™ 
Kit (Oncor, Inc., Gaithersburg, MD) showed that transfection of sense hTRT - but not 
antisense hTRT - generated telomerase activity in both the BJ and IMR90 cell strains. 



1 5 Expression of Recombinant hTRT in Immoralized IMR90 Human Cells 

Using the same hTRT sense construct cloned into CMV vectors used in the above 
described diploid human BJ and IMR90 cell strains studies, immortalized SW13 ALT 
pathway cell line (an IMR90 cell immortalized with SV40 antigen) was transiently 
transfected. A TRAP assay (TRAPeze, Oncor, Inc, Gaithersburg, MD) demonstrated that 
20 telomerase activity was generated in the sense construct transfected cells. 

Vectors for Regulated Expression of hTRT in Mammalian Cells: Inducible and 
Repressible Expression of hTRT 

The invention provides vectors that can be manipulated to induce or repress 
25 the expression of the TRTs of the invention, such as hTRT. For example, the hTRT coding 
sequence can be cloned into the Ecdysone-Inducible Expression System from Invitrogen (San 
Diego, CA) and the Tet-On and Tet-ofF tetracycline regulated systems from Clontech 
Laboratories, Inc. (Palo Alto, CA). Such inducible expression systems are provided for use in 
the methods of the invention where it is important to control the level or rate of transcription 
30 of transfected TRT. For example, the invention provides for cell lines immortalized through 
the expression of hTRT; such cells can be rendered "mortal" by inhibition of hTRT 
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expression by the vector through transcriptional controls, such as those provided by the 
Tet-Off system. The invention also provides for methods of expressing TRT only transiently 
to avoid the constitutive expression of hTRT, which may lead to unwanted "immortalization" 
of the transfected cells, as discussed above. 
5 The Ecdysone-Inducible Mammalian Expression System is designed to allow 

regulated expression of the gene of interest in mammalian cells. The system is distinguished 
by its tightly regulated mechanism that allows almost no detectable basal expression and 
greater than 200-fold inducibility in mammalian cells. The expression system is based on the 
heterodimeric ecdysone receptor of Drosophila. The Ecdysone-Inducible Expression System 

1 0 uses a steroid hormone ecdysone analog, muristerone A, to activate expression of hTRT via a 
heterodimeric nuclear receptor. Expression levels have been reported to exceed 200-fold over 
basal levels with no effect on mammalian cell physiology "Ecdysone-Inducible Gene 
Expression in Mammalian Cells and Transgenic Mice" (1996) Proc. Natl Acad. Scl USA 93, 
3346-3351). Once the receptor binds ecdysone or muristerone, an analog of ecdysone, the 

1 5 receptor activates an ecdysone-responsive promoter to give controlled expression of the gene 
of interest. In the Ecdysone-Inducible Mammalian Expression System, both monomers of the 
heterodimeric receptor are constitutively expressed from the same vector, pVgRXR. The 
ecdysone-responsive promoter, which ultimately drives expression of the gene of interest, is 
located on a second vector, pIND, which drives the transcription of the gene of interest. 

20 The hTRT coding sequence is cloned in the pIND vector (Clontech 

Laboratories, Inc, Palo Alto, CA), which contains 5 modified ecdysone response elements 
(E/GREs) upstream of a minimal heat shock promoter and the multiple cloning site. The 
construct is then transfected in cell lines which have been pre-engineered to stably express the 
ecdysone receptor. After transfection, cells are treated with muristerone A to induce 

25 intracellular expression from pIND. 

The Tet-on and Tet-off expression systems (Clontech, Palo Alto, CA) give 
access to the regulated, high-level gene expression systems described by Gosscn (1992) 
"Tight control of gene expression in mammalian cells by tetracycline responsive promoters" 
Proc. Natl Acad Set USA 89:5547-5551, for the Tet-Off transcription repression system; 

30 and Gossen (1995) "Transcriptional activation by tetracycline in mammalian cells" Science 
268: 1766-1769, for the Tet-On inducible transcriptional system. In "Tet-OfT transformed 
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cell lines, gene expression is turned on when tetracycline (Tc) or doxycycline ("Dox;" a Tc 
derivative) is removed from the culture medium. In contrast, expression is turned on in 
Tet-On cell lines by the addition of Tc or Dox to the medium. Both systems permit 
expression of cloned genes to be regulated closely in response to varying concentrations of Tc 
or Dox. 

This system uses the "pTRE" as a response plasmid that can be used to 
express a gene of interest. Plasmid pTRE contains a multiple cloning site (MCS) 
immediately downstream of the Tet-responsive PhCMV*-l promoter. Genes or cDNAs of 
interest inserted into one of the sites in the MCS will be responsive to the tTA and rtTA 
regulatory proteins in the Tet-Off and Tet-On systems, respectively. PhCMV*-l contains the 
Tet-responsive element (TRE), which consists of seven copies of the 42-bp tet operator 
sequence (tetO). The TRE element is just upstream of the minimal CMV promoter 
(PminCMV), which lacks the enhancer that is part of the complete CMV promoter in the pTet 
plasmids. Consequently, PhCMV*-l is silent in the absence of binding of regulatory proteins 
to the tetO sequences. The cloned insert must have an initiation codon. In some cases, 
addition of a Kozak consensus ribosome binding site may improve expression levels; 
however, many cDNAs have been efficiently expressed in Tet systems without the aiddition of 
a Kozak sequence. pTRE-Gene X plasmids are cotransfected with pTK-Hyg to permit 
selection of stable transfectants. 

Setting up a Tet-Off or Tet-On expression system generally requires two 
consecutive stable transfections to create a "double-stable" cell line that contains integrated 
copies of genes encoding the appropriate regulatory protein and TRT under the control of a 
TRE. In the first transfection, the appropriate regulatory protein is introduced into the cell line 
of choice by transfection of a "regulator plasmid" such as pTet-Off or pTet-On vector, which 
expresses the appropriate regulatory proteins. The hTRT cloned in the pTRE "response 
plasmid" is then introduced in the second transfection to create the double-stable Tet-Off or 
Tet-On cell line. Both systems give very tight on/off control of gene expression, regulated 
dose-dependent induction, and high absolute levels of gene expression. 

Expression Recombinant hTRT With DHFR and Adenovirus Sequences 

The pGRN155 plasmid construct was designed for transient expression of 
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hTRT cDNA in mammalian cells. A Kozak consensus is inserted at the 5' end of the hTRT 
sequence. The hTRT insert contains no 3' or 5' UTR. The hTRT cDNA is inserted into the 
EcoRI site of p91023(B) (Wong (1985) Science 228:810-815). The hTRT insert is in the 
same orientation as the DHFR ORF. 
5 Plasmid pGRN155 contains the SV40 origin and enhancer just upstream of an 

adenovirus promoter, a tetracycline resistance gene, an E. coli origin and an adenovirus VAI 
and VAII gene region. This expression cassette contains, in the following order: the 
adenovirus major late promoter; the adenovirus tripartite leader; a hybrid intron consisting of 
a 5 1 splice site from the first exon of the tripartite leader and a 3 1 splice site from the mouse 

1 0 immunoglobulin gene; the hTRT cDNA; the mouse DHFR coding sequence; and, the SV40 
polyadenylation signal. 

The adenovirus tripartite leader and the VA RNAs have been reported to 
increase the efficiency with which polycistronic mRNAs are translated. DHFR sequences 
have been reported to enhance the stability of hybrid mRNA. DHFR sequences also can 

1 5 provide a marker for selection and amplification of vector sequences. See Logan ( 1 984) 

Proc. Natl Acad ScL USA 81:3655); Kaufman (1985) Proc. Natl. Acad. Sci. USA 82: 689 ; 
and Kaufman (1988) Focus (Life Technologies, Inc.), Vol.10, no. 3). This makes the 
expression vector particularly useful for transient expression. 

20 

Other expression plamids of the invention are described for illustrative 

purposes. 



25 pGRN121 

The EcoRI fragment from lambda clone 25-1 .1 .6 containing the entire cDNA encoding hTRT 
protein was inserted into the EcoRI site of pBluescriptIISK+ such that the 5' end of the cDNA 
is near the T7 promoter in the vector. The selectable marker that is used with this vector is 
ampicillin. 

30 

pGRN122 
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The NotI fragment from pGRN121 containing the hTRT ORF was inserted into the NotI site 
of pEB VHisA so that the coding sequence is operably linked to the RSV promoter. This 
plasmid expresses a fusion protein composed of a His6 flag fused to the N-terminal of the 
hTRT protein. The selectable marker that is used with this vector is ampicillin or 
5 hygromycin. 

pGRN123 

The NotI fragment from pGRN121 containing the hTRT ORF was inserted into the NotI site 
of pEB VHisA so that the coding sequence is in the opposite orientation as the RSV promoter, 
1 0 thus expressing antisense hTRT. 

pGRN124 

Plasmid pGRN121 was deleted of all Apal sites followed by deletion of the MscI-HincII 
fragment containing the 3TJTR. The Nco-Xbal fragment containing the stop codon of the 
15 hTRT coding sequence was then inserted into the Nco-Xbal sites of pGRN121 to make a 
plasmid equivalent to pGRN121 except lacking the 3'UTR, which may be preferred for 
increased expression levels in some cells. 

pGRN125 

20 The NotI fragment from pGRN124 containing the hTRT coding sequence was inserted into 
the NotI site of pBBS235 so that the open reading frame is in the opposite orientation of the 
Lac promoter. The selectable marker that is used with this vector is chloramphenicol. 

pGRN126 

25 The NotI fragment from pGRN124 containing the hTRT coding sequence was inserted into 
the NotI site of pBBS235 so that the hTRT coding sequence inserted is in the same 
orientation as the Lac promoter. 

pGRN127 

30 The oligonucleotide 5-TGCGCACGTGGGAAGCCCTGGCagatctgAattCCaCcATGC 

CGCGCGCTCCCCGCTG-3 , was used in in vitro mutagenesis of pGRN125 to convert the 
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initiating ATG codon of the hTRT coding sequence into a Kozak consensus sequence and 
create EcoRI and Bglll sites for cloning. Also, oligonucleotide COD2866 was used to 
convert AmpS to AmpR (ampicillin resistant) and oligonucleotide COD 1941 was used to 
convert CatR (chloramphenicol resistant) to CatS (chloramphenicol sensitive). 

5 

pGRN128 

The oligonucleotide 5 f -TGCGCACGTGGGAAGCCCTGGCagatctgAattCCaCcATG 
CCGCGCGCTCCCCGCTG-3' is used in in vitro mutagenesis to convert the initiating ATG 
codon of hTRT into a Kozak consensus and create EcoRI and Bglll sites for cloning. Also, 
10 oligo S'-CTGCCCTCAGACTTCAAGACCATCCTGGACTACAA 

GGACGACGATGACAAATGAATTCAGATCTGCGGCCGCCACCGCGGTGGAGCTCC 
AGC-3* is used to insert the IBI Flag (International Biotechnologies Inc. (IBI), Kodak, New 
Haven, CT) at the C-terminus and create EcoRI and Bglll sites for cloning. Also, COD2866 
is used to convert AmpS to AmpR and COD1941 is used to convert CatR to CatS. 

15 

pGRN129 

The oligonucleotide S'-CGGGACGGGCTGCTCCTGCGTTTGGTGGAcGcgTTCTTG 
TTGGTGACACCTCACCTCACC-3 , was used by in vitro mutagenesis to convert Asp869 to 
an Ala codon (i.e. the second Asp of the DD motif was converted to an Alanine to create a 
20 dominant/negative hTRT mutant). This also created a Mlul site. Also, oligonucleotide 5- 
CTGCCCTCAGACTTCAAGACCATCCTGGACTACAAGG 

ACGACGATGACAAATGAATTCAGATCTGCGGCCGCCACCGCGGTGGAGCTCCAG 
C-3') was used to insert the IBI Flag at the C-terminus and create EcoRI and Bglll sites for 
cloning. Also, COD2866 was used to convert AmpS to AmpR and COD 1941 was used to 
25 convert CatR to CatS. 

pGRN130 

The oligonucleotide 5'-CGGGACGGGCTGCTCCTGCGTTTGGTGGAcGcgTTCTT 
GTTGGTGACACCTCACCTCACC-3' was used in in vitro mutagenesis to convert the 
30 Asp869 codon into an Ala codon (i.e. the second Asp of the DD motif was converted to an 
Alanine to make a dominant/negative variant protein). This also created an Mlul site. Also, 
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the oligonucleotide 5'-TGCGCACGTGGGAAGCCCTGGCagatctgAatt 
CCaCcATGCCGCGCGCTCCCCGCTG-3' was used in in vitro mutagenesis to convert the 
initiating ATG codon of the hTRT coding sequence into a Kozak consensus sequence and 
create EcoRI and Bglll sites for cloning. Also, COD2866 was used to convert AmpS to 
AmpR and COD1941 was used to convert CatR. 

pGRN131 

The EcoRI fragment from pGRN128 containing the hTRT ORF with Kozak sequence and IBI 
Flag mutations is inserted into the EcoRI site of pBBS212 so that the hTRT ORF is expressed 
off the MPSV promoter. Plasmid pBSS212 contains a MPSV promoter, the CMV enhancer, 
and the SV40 polyadenylation site. 

pGRN132 

The EcoRI fragment from pGRN128 containing the hTRT ORF with Kozak sequence and IBI 
Flag mutations is inserted into the EcoRI site of pBBS212 so that the antisense of the hTRT 
ORF is expressed off the MPSV promoter. 



pGRN133 

The EcoRI fragment from pGRN121 containing the hTRT coding sequence was inserted into 
the EcoRI site of pBBS212 so that the hTRT protein is expressed under the control of the 
MPSV promoter. 

pGRN134 

The EcoRI fragment from pGRN121 containing the hTRT coding sequence was inserted into 
the EcoRI site of pBBS212 so that the antisense of the hTRT coding sequence is expressed 
under the control of the MPSV promoter. The selectable markers used with this vector are 
Chlor/HygB/PAC. 

pGRN135 

Plasmid pGRN126 was digested to completion with MscI and Smal and religated to delete 
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over 95% of the hTRT coding sequence inserted. One Smal-MscI fragment was re-inserted 
during the process to recreate the Cat activity for selection. This unpurified plasmid was then 
redigested with Sail and EcoRI and the fragment containing the initiating codon of the hTRT 
coding sequence was inserted into the Sall-EcoRI sites of pBBS212. This makes an antisense 
5 expression plasmid expressing the antisense of the 5'UTR and 73 bases of the coding 
sequence. The selectable markers used with this vector are Chlor/HygB/PAC. 

pGRN136 

The Hindlll-Sall fragment from pGRN126 containing the hTRT coding sequence was 
1 0 inserted into the Hindlll-Sall sites of pBBS242. 

pGRN137 

The SalI-Sse8387I fragment from pGRN130 containing the Kozak sequence was inserted into 
the SalI-Sse8387I sites of pGRN136. 

15 

pGRN138 

The EcoRI fragment from pGRN124 containing hTRT minus the 3'UTR was inserted into the 
EcoRI site of pEGFP-C2 such that the orientation of the hTRT is the same as the EGFP 
domain. 

20 

pGRN139 

The oligonucleotide 5'- CTGCCCTCAGACTTCAAGACCATCCTGGACTACAAGG 
ACGACGATGACAAATGAATTCAGATCTGCGGCCGCCACCGCGGTGGAGCTCCAG 
C-3 was used to insert the IBI Flag at the C-terminus of hTRT in pGRN125 and crea te EcoRI 
25 and Bglll sites for cloning. Also, COD2866 was used to convert AmpS to AmpR) and 
COD 1941 was used to convert CatR to CatS. 

pGRN140 

The Ncol fragment containing the upstream sequences of genomic hTRT and the first intron 
30 of hTRT from lambdaG55 was inserted into the Ncol site of pBBS167. The fragment is 
oriented so that hTRT is in the same direction as the Lac promoter. 

251 



\ i 'i hi i mi ii i 1 i ii in 11 r 



# • 

pGKN141 

The Ncol fragment containing the upstream sequences of genomic hTRT and the first intron 
of hTRT from lambdaG55 was inserted into the Ncol site of pBBS167. The fragment is 
oriented so that hTRT is in the opposite direction as the Lac promoter. 

5 

pGRN142 

The NotI fragment from lambdaGphiS containing the complete -15 kbp genomic insert 
including the hTRT gene promoter region was inserted in the NotI site of plasmid pBBS185. 
The fragment is oriented so that the hTRT ORF is in the opposite orientation as the Lac 
10 promoter. 

pGRN143 

The NotI fragment from lambdaGphiS containing the complete -15 kbp genomic insert 
including the hTRT gene promoter region was inserted in the NotI site of plasmid pBBS185. 
15 The fragment is oriented so that the hTRT ORF is in the same orientation as the Lac 
promoter. 

pGRN144 

SAL1 deletion of pGRN140 to remove lambda sequences. 

20 

pGRN145 

This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
EcoRI fragment from pGRN137 containing the hTRT coding sequence was inserted into the 
EcoRI site of pBBS212 to remove the portion of the sequence corresponding to the 5 f UTR of 
25 hTRT mRNA. The hTRT coding sequence is oriented so that it is expressed under the 
control of the MPSV promoter. The selectable markers used with this vector are 
Chlor/HygB/PAC. 

pGRN146 

30 This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
Sse8387I-NotI fragment from pGRN130 containing the D869A mutation of hTRT was 
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inserted into the Sse8387I-NotI sites of pGRN137. The selectable markers used with this 
vector are Ampicillin/HygB/PAC. 

pGRN147 

5 The Sse8387I-NotI fragment from pGRN139 containing the IBI Flag was inserted into the 
Sse8387I-NotI sites of pGRN137. 

pGRN148 

The BglII-Eco47III fragment from pGRN144 containing the promoter region of hTELT was 
10 inserted into the Bglll-Nrul sites of pSEAP2 to make an hTRT promoter/reporter construct. 

pGRN149 

This vector is an intermediate vector for constructing a hTRT fusion protein expression 
vector. The mutagenic oligo 5 ! -cttcaagaccatcctggactttcgaaacgcggccgccaccg 
15 cggtggagctcc-3 1 was used to add a CSP45I site at the C-terminus of hTRT by in vitro 
mutagenesis of pGRN125. The "stop" codon of hTRT was deleted and replaced with a 
Csp45I site. The selectable marker that is used with this vector is ampicillin. 

pGRNlSO 

20 The Bglll-Fspl fragment from pGRN144 containing the promoter region of hTRT was 

inserted into the Bglll-Nrul sites of pSEAP2 to make an hTRT promoter/reporter construct. 

pGRN151 

This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
25 EcoRI fragment from pGRN147 containing the hTRT coding sequence was inserted into the 
EcoRI site of pBBS212 to remove the portion of the sequence corresponding to the 5'UTR of 
the hTRT mRNA. The hTRT coding sequence is oriented so that it is expressed under the 
control of the MPSV promoter. The selectable markers used with this vector are 
Chlor/HygB/PAC. 

30 

pGRN152 
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The EcoRI fragment from pGRN146 containing the hTRT coding sequence was inserted into 
the EcoRI site of pBBS212 to remove the portion of the sequence corresponding to the 
5'UTR of the hTRT. The hTRT coding sequence is oriented so that it is expressed under the 
control of the MPSV promoter. 

5 

pGRN153 

The Styl fragment from pGRN130 containing the D869-->A mutation of hTRT (hTRT 
variant coding sequence) was inserted into the Styl sites of pGRN158 to make a plasmid 
containing the hTRT coding sequence with a Kozak consensus sequence at its 5'-end, an IBI 
10 FLAG sequence at its 3'-end (the C-terminus encoding region), and the D869-->A mutation. 



pGRN154 

The EcoRI fragment of pGRN153 containing the hTRT gene was inserted into the EcoRI site 
of plasmid pBS212 in an orientation such that the hTRT ORF is oriented in the same 
15 direction as the MPSV promoter. This makes an MPSV-directed expression plasmid that 

expresses the hTRT protein with a Kozak consensus sequence at its amino-terminal end, an 
IBI FLAG at its carboxy-terminal end, and the D869->A mutation 



pGRN155 

20 This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
insert included full length cDNA of hTRT minus 5' and 3' UTR, and Kozak sequences. The 
EcoRI fragment from pGRN145 containing the hTRT cDNA with the Kozak consensus and 
no 3' or 5 f UTR was inserted into the EcoRI site of p91023(B) such that the hTRT is in the 
same orientation as the DHFR ORF. This makes a transient expression vector for hTRT. The 

25 selectable marker used with this vector is tetracycline. 



pGRN156 

This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
EcoRI fragment from pGRN146 containing the D869A mutation of the hTRT cDNA with the 
30 Kozak consensus and no 3 f or 5' UTR was inserted into the EcoRI site of p91023(B) such that 
the hTRT is in the same orientation as the DHFR ORF. This makes a transient expression 
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vector for hTRT. The insert included full length cDNA of hTRT minus 5' and 3' UTR, 
D869A, and Kozak sequences. The selectable marker used with this vector is tetracycline. 



pGRN157 

5 This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
EcoRI fragment from pGRN147 containing the hTRT cDNA with the IBI FLAG at the C- 
terminus; the Kozak consensus and no 3' or 5 f UTR into the EcoRI site of p91023(B) such 
that the hTRT is in the same orientation as the DHFR ORF. This makes a transient expression 
vector for hTRT. The insert included full length cDNA of hTRT minus 5* and 3 f UTR, the IBI 

10 FLAG sequence, and Kozak sequences. The selectable marker used with this vector is 
tetracycline. 

pGRN158 

This vector was constructed for the expression and mutagenesis of TRT sequences in E. coll 
1 5 The EcoRI fragment from pGRNl 5 1 containing the hTRT ORF was inserted into the EcoRI 
site of pBBS183 so that the hTRT ORF is oriented in the opposite direction as the Lac 
promoter. The insert included full length cDNA of hTRT minus 5' and 3' UTR, IBI FLAG 
sequence, and Kozak sequences. The hTRT coding sequence is driven by a T7 promoter. 
The selectable marker used with this vector is amphicillin. 

20 

pGRN159 

This vector was constructed for the expression and mutagenesis of TRT sequences in E, coli. 
The Nhel-Kpnl fragment from pGRN138 containing the EGFP to hTRT fusion was inserted 
into the Xbal-Kpnl sites of pBluescriptIIKS+. This makes a T7 expression vector for the 
25 fusion protein (the coding sequence is driven by a T7 promoter). The insert included full 
length cDNA of hTRT minus the 3' UTR as a fusion protein with EGFP. The selectable 
marker used with this vector is amphicillin. 

pGRN160 

30 This vector was constructed for the expression of antisense hTR sequences in mammalian 
cells. The coding sequence is operably linked to an MPSV promoter. The Xhol-Nsil 
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fragment from pGRN90 containing the full length hTR ORF was inserted into the Sall- 
Sse8387I sites of pBBS295. This makes a transient/stable vector expressing hTR antisense 
RNA. A GPT marker was incorporated into the vector. The selectable markers used with 
this vector are Chlor/gpt/PAC. 

5 

pGRN161 

This vector was constructed for the expression of sense hTR sequences in mammalian cells. 
The Xhol-Nnil fragment from pGRN89 containing the full length hTR ORF was inserted into 
the SalI-Sse8387I sites of pBBS295. This makes a transient/stable vector expressing hTR in 
10 the sense orientation. The coding sequence is driven by an MPSV promoter. A GPT marker 
was incorporated into the vector. The selectable markers used with this vector are 
Chlor/gpt/PAC. 

pGRN162 

15 The Xhol-Nsil fragment from pGRN87 containing the full length hTR ORF was inserted into 
the SalI-Sse8387I sites of pBBS295. This makes a transient/stable vector expressing 
truncated hTR (from position +108 to 4-435) in the sense orientation. 

pGRN163 

20 This vector was constructed for the expression and mutagenesis of TRT sequences in E, coli. 
The coding sequence is driven by a T7 promoter. Oligonucleotide RA45 
(S'-GCCACCCCCGCGCTGCCTCGAGCTCCCCGCTGC-S') is used in in vitro mutagenesis 
to change the initiating met in hTRT to Leu and introduce an Xhol site in the next two codons 
after the Leu. Also COD 1941 was used to change CatR to CatS, and introduces a BSPH1 

25 site, and COD 2866 was used to change AmpS to AmpR, introducing an FSP1 site. The 
selectable marker used with this vector is amphicillin. 

pGRN164 

This vector was constructed for the expression of hTR sequences in E. coli. Primers hTR+1 
30 5-GGGGAAGCTTTAATACGACTCACTATAGGGTTGCGGAGGGTGG 

GCCTG-3' andhTR+445 S'-CCCCGGATCCTGCGCATGTGTGAGCCGAGTCCT 
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GGG-3'were used to amplify by PCR a fragment from pGRN33 containing the full length 
hTR with the T7 promoter on the 5' end (as in hTR+1). A BamHI-Hindlll digest of the PCR 
product was put into the BamHI-Hindlll sites of pUCl 19. The coding sequence operably 
linked to a T7 promoter. The selectable marker used with this vector is amphicillin. 
5 pGRN164 is also called phTR+1 . 

pGRN165 

This vector was constructed for the expression and mutagenesis of hTRT sequences in E. coll 
The coding sequence is operably linked to a T7 promoter. The EcoRI fragment from: 
1 0 pGRN 1 45 containing the hTRT ORF with a Kozak front end was inserted into the EcoRI site 
of pBluescriptIISK+ so that the hTRT is oriented in the same direction as the T7 promoter. 
The selectable marker used with this vector is amphicillin. 

pGRN166 

1 5 This vector was constructed for the expression and mutagenesis of TRT sequences in 

mammalian cells. The coding sequence is operably linked to a T7 promoter. The EcoRI 
fragment from pGRNl 5 1 containing the hTRT ORF with a Kozak front end and IBI flag at 
the back end was inserted into the EcoRI site of pBluescriptIISK+ so that the hTRT ORF is 
oriented in the same direction as the T7 promoter. The insert included full length cDNA of 

20 hTRT minus 5' and 3' UTR, FLAG sequence (Immunex Corp, Seattle WA), and Kozak 
sequences. The selectable marker used with this vector is amphicillin. 

pGRN167 

AvRII-StuI fragment from pGRN144 containing the 5' end of the hTRT ORF was inserted 
25 into the Xbal-StuI sites of pBBS161. 

pGRN168 

The EcoRI fragment from pGRN145 containing the optimized hTRT expression cassette was 
inserted into the EcoRI site of pIND such that the hTRT coding sequence is in the same 
30 orientation as the miniCMV promoter. 
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pGRN169 

The EcoRI fragment from pGRN145 containing the optimized hTRT expression cassette was 
inserted into the EcoRI site of pIND such that the hTRT is in the reverse orientation from the 
miniCMV promoter. 

5 

pGRN170 

The EcoRI fragment from pGRN145 containing the optimized hTRT expression cassette was 
inserted into the EcoRI site of pIND(spl) such that the hTRT is in the opposite orientation 
from the miniCMV promoter. 

10 

pGRN171 

The Eco47III-NarI fragment from pGRN163 was inserted into the Eco47III-NarI sites of 
pGRN167, putting the MIL mutation into a fragment of the hTRT genomic DNA. 

15 pGRN172 

The BamHI-StuI fragment from pGRN171 containing the Met to Leu mutation in the hTRT 
ORF was inserted into the Bglll-Nrul sites of pSEAP2-Basic. 

pGRN173 

20 The EcoRV-EC047III fragment from pGRN 1 44 containing the 5' end of the hTRT 
promoter region was inserted into the SrfI-Eco47III sites of pGRN172. This makes a 
promoter reporter plasmid that contains the promoter region of hTRT from approximately 2.3 
kb upstream from the start of the hTRT ORF to just after the first intron in the coding region, 
with the Metl— >Leu mutation. 

25 

pGRN174 

The EcoRI fragment from pGRN145 containing the "optimized" hTRT expression cassette 
was inserted into the EcoRI site of pIND(spl) such that the hTRT is in the same orientation 
as the miniCMV promoter. 

30 
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EXAMPLE 7 
RFCONSTITUTTON OF TELOMERASE ACTIVITY 

A. Co-Expression of hTRT and hTR in vitro 

In this example, the coexpression of hTRT and hTR vising an in vitro cell-free 
5 expression system is described. These results demonstrate that the hTRT polypeptide 
encoded by pGRN121 encodes a catalytically active telomerase protein and that in vitro 
reconstitution (IVR) of the telomerase RNP can be accomplished using recombinantly 
expressed hTRT and hTR. 

Telomerase activity was reconstituted by adding linearized plasmids of hTRT 
10 (pGRN121 ; 1 ug DNA digested with Xba I) and hTR (phTR+1 ; 1 ug digested with Fspl) to a 
coupled transcription-translation reticulocyte lysate system (Promega TNT™). phTR+1 is a 
plasmid which, when linearized with Fspl and then transcribed by T7 RNA polymerase, 
generates a 445 nucleotide transcript beginning with nucleotide +1 and extending to 
nucleotide 446 of hTR (Autexier et al., 1996, EMBO J 15:5928). For a 50 ul reaction the 
1 5 following components were added: 2 ul TNT™ buffer, 1 ul TNT™ T7 RNA polymerase, 1 
ul 1 mM amino acid mixture, 40 units Rnasin™ RNase inhibitor, 1 ug each linearized 
template DNA, and 25 ul TNT™ reticulocyte lysate. Components were added in the ratio 
recommended by the manufacturer and were incubated for 90 min at 30°C. Transcription was 
under the direction of the T7 promoter and could also be carried out prior to the addition of 
20 reticulocyte lysate with similar results. After incubation, 5 and 10 ul of the programmed 
transcription-translation reaction were assayed for telomerase activity by TRAP as 
previously described (Autexier et al., supra) using 20 cycles of PCR to amplify the signal. 

The results of the reconstitution are shown in Figure 10. For each 
transcription/translation reaction assayed there are 3 lanes: The first 2 lanes are duplicate 
25 assays and the third lane is a duplicate sample heat denatured (95 ° C, 5 min) prior to the 
TRAP phase to rule out PCR generated artifacts. 

As shown in Figure 10, reticulocyte lysate alone has no detectable telomerase 
activity (lane 6). Similarly, no detectable activity is observed when either hTR alone (lane 1) 
or full length hTRT gene (lane 4) are added to the lysate. When both components aire added 
30 (lane 2), telomerase activity is generated as demonstrated by the characteristic repeat ladder 
pattern. When the carboxyl-terminal region of the hTRT gene is removed by digestion of the 
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vector with Ncol ("truncated hTRT") telomerase activity is abolished (lane 3). Lane 5 shows 
that translation of the truncated hTRT alone does not generate telomerase activity. Lane "R8" 
shows a positive control for a telomerase product ladder generated by TRAP of TSR8, a 
synthetic telomerase product having a nucleotide sequence of 
5 5 , -ATTCCGTCGAGCAGAGTTAG[GGTTAG] 7 -3 f , 

It was also observed that purification of IVR telomerase resulted in a stronger 
signal and/or reduced background in certain telomerase activity assays. In some experiments, 
IVR telomerase activity from co-synthesized components was enriched by fractionation of 
TnT reactions over DEAE anion exchange membraues (Millipore Ultrafree-MC): 200 \x\ of 

10 the hTRT/hTR TnT reaction was passed through a single DEAE membrane. The membrane 
was washed with 400 \x\ of 0.2 M NaCl in buffer A (20 mM HEPES-KOH pH 7.9, 2 mM 
MgCl 2 , 1 mM EGTA, 10% glycerol, 0.1% Nonidet P-40, 0.1 mM phenylmethylsulfonyl 
fluoride) and IVR telomerase was eluted from the membrane with 80 jjiI of 1 M NaCl in 
buffer A. Alternatively, batch chromatography was used: 400 \i\ of the TnT reaction was 

1 5 partially purified by batch chromatography using 25 nl of Toso-Haas Q-650M resin. After 
binding telomerase to the resin, it was washed with 0.1 M NaCl in buffer A, followed by a 
second wash with 0. 1 8 M NaCl in buffer A and eluted with 1 00 \i\ of 0.3 M NaCl in buffer A. 

B. Mixing of hTRT and hTR in vitro 

20 In vitro reconstitution of telomerase activity was also accomplished by mixing. 

hTRT was transcribed and translated as described supra, but without the addition of the hTR 
plasmid. Reconstitution of the telomerase RNP was then accomplished by mixing the hTRT 
translation mixture with hTR (previously generated by T7 RNA polymerase transcription 
from phTR+l-Fsp) in the ratio of 2 \il of hTRT translation mix to 2 \i\ of hTR (1 ug) then 

25 incubated for 90 minutes at 30° C. The reaction conditions were adjusted to a KC1 

concentration of about 0.2 M. (The presence of KC1 at a concentration of about 0.1 M to 
about 1 .0 M may enhance telomerase activity or telomerase reconstitution in IVR). This 
method of hTRT/hTR reconstitution is referred to as "linked reconstitution" or "linked IVR." 
Telomerase activity is present (i.e., can be detected) in this mixture. Improved signal was 

30 observed following partial purification of the activity by DEAE chromatography. In this case 
Millipore Ultrafree-MC DEAE Centrifugal Filter Devices were used according to the 
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manufacturer's directions). The buffers used were hypoO. 1 , hypo0.2, and hypol .0, where 
hypo is 20 mM Hepes-KOH, pH 7.9, 2 mM MgC12, 1 mM EGTA, 10 % glycerol, 0.1 % 
NP-40, 1 mM DTT, 1 mM Na-metabisulfite, 1 mM benzamidine, and 0.2 mM 
phenylmethylsulfonylflouride (PMSF), and where 0.1, 0.2 and 1.0 refers to 0.1, 0.2 or 1.0 M 
5 KCL. The filters were pre-conditioned with hypol.0, washed with hypoO.l, the reconstituted 
telomerase was loaded, the column was washed with hypoO.l then hypo0.2, and the 
reconstituted telomerase was eluted with hypol .0 at half the volume as was loaded. This 
formulation could be stored frozen at -70°C and retains activity. 

Telomerase activity was assayed in a two step procedure. In step one, a 
10 conventional telomerase assay was performed as described in Morin, 1989, Cell 59: 521, 
except no radiolabel was used. In step two, an aliquot was assayed by the TRAP procedure 
for 20-30 cycles as described supra. The conventional assay was performed by assaying 1-10 
ul of reconstituted telomerase in 40-50 ul final volume of 25 mM Tris-HCl, pH 8.3, 50 mM 
K-acetate, 1 mM EGTA, 1 mM MgC12, 2 mM dATP, 2 mM TTP, 10 uM dGTP, and 1 uM 
1 5 primer (usually M2, S'-AATCCGTCGAGCAGAGTT) at 30 ° C for 60-1 80 minutes. The 
reaction was stopped by heating to 95° C for 5 minutes and 1-10 ul of the first step mixture 
was carried onto the step two TRAP reaction (50 ul). 

In additional experiments, the synthesis of hTRT and hTR during in vitro 
reconstitution was monitored by 35 S-methionine incorporation and Northern blotting, 
20 respectively. Proteins of approximately the predicted size were synthesized for hTRT (1 27 
kD), hTRT-Nco (85 kD), and pro90hTRT (90 kD) in approximately equal molar amounts 
relative to each other. The Northern analysis indicated hTR synthesis was the correct size 
(445 nucleotides) and predominantly intact. 

High levels of reconstitution and telomerase activity were also obtained with 2 
25 ug of linearized pGRN121 in a 50 ul TnT reaction as described supra (Example 7A) except 
that in place of the hTR template, 4 pmol (0.6 ug) of hTR RNA (previously generated by T7 
RNA polymerase transcription from phTR+l-Fsp) was added at the beginning of the TnT 
reaction and the reaction was incubated at 30°C for 90-120 minutes. Slightly greater (2-5 
times) activity was achieved using 1 ug of supercoiled XhTRT-E and 16 pmol (2.4 ug) of 
30 pre-synthesized hTR RNA set up in a 50 ul TnT reaction, as described supra, with incubation 
at 30°C for 90-120 minutes. XhTRT-E is an hTRT construct in the pcDNA3.1/His Xpress 
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vector (Invitrogen) in which an optimized ribosome recognition site (Kozak consensus), six 
histidine residues, and an epitope tag are fused with the hTRT open reading frame. 

Variations of the reconstitution protocols, supra, will be apparent to those of 
skill. For example, the time and temperature of reconstitution, and presence or concentration 

5 of components such as monovalent salt (e.g.. NaCl, KC1, potassium acetate, potassium 
glutamate, and the like), divalent salt (MgCl 2 , MnCl 2 , MgS0 4 , and the like), denaturants 
(urea, formamide, and the like), detergents (NP-40, Tween, CHAPS, and the like), and 
alternative improved purification procedures (such as immunoprecipitation, affinity or 
standard chromatography) can be employed. These and other parameters can be varied in a 

1 0 systematic way to optimize conditions for particular assays or other reconstitution protocols. 

C. Reconstitution Using hTRT Variants and Fusion Proteins 

Reconstitution of telomerase catalytic activity occurred when EGFP-hTRT, a 
fusion of the enhanced green fluorescent protein to hTRT (see Examples 6 and 15), or 
15 epitope-tagged hTRT (IBI FLAG, see Example 6) both reconstituted telomerase activity at 
approximately wild-type levels were coexpressed with hTR. 

In contrast, telomerase activity was not reconstituted when a variant hTRT, 
pro90hTRT (missing RT motifs B', C, D, and E) was used. This demonstrates that 
pro90hTRT does not possess full telomerase catalytic activity, although it may have other 
20 partial activities (e.g., RNA [i.e. hTR] binding ability and function as dominant-negative 
regulator of telomerase in vivo as described supra). 

D. Assay of in vitro Reconstituted Telomerase Activity Using the Gel Blot and 
Conventional Telomerase Assay 

25 The following example demonstrates that in vitro reconstituted (IVR) 

telomerase can be assayed using conventional telomerase assays in addition to 
amplification-based assays (i.e., TRAP). IVR telomerase as described in part (B), supra (the 
"linked reconstitution method") followed by DEAE purification, as described supra was 
assayed using the gel blot assay using the following reaction conditions; 1-10 ^il of linked 

30 IVR telomerase in 40 \x\ final volume of 25 mM Tris-HCl, pH 8.3, 50 mM K-acetate, 1 mM 
EGTA, 1 mM MgC12, 0.8 mM dATP, 0.8 mM TTP, 1.0 mM dGTP, and 1 uM primer ( M2, 
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supra; or H3.03, S'-TTAGGGTTAGGGTTAGGG) at 30°C for 180 minutes. The telomeric 
DNA synthesized was isolated by standard procedures, separated on a 8 % polyacrylamide, 8 
M urea gel, transfered to a nylon membrane, and probed using the 32 P-(CCCTAA)n riboprobe 
used in the dot-blot assay. The probe identified a six nucleotide ladder in the lane 
5 representing 10 \il of IVR telomerase that was equivalent to the ladder observed for 5 \x\ of 
native nuclear telomerase purified by mono Q and heparin chromatography. The results show 
that IVR telomerase possesses processive telomerase catalytic activity equivalent to native 
telomerase. 

Linked IVR telomerase was also assayed by the conventional 32 P-dGTP 
1 0 incorporation telomerase assay. IVR telomerase prepared by the linked reconstitution method 
followed by DEAE purification, as described above, was assayed under both processive and 
non-processive reaction conditions. Assay conditions were 5-10 |il of linked IVR telomerase 
in 40 \il final volume of 25 mM Tris-HCl, pH 8.3, 50 mM K-acetate, 1 mM EGTA, 1 mM 
MgC12, 2 mM dATP, 2 mM TTP, with 10 uM 32 P-dGTP (72 Ci/mmol) [for assay of 
15 processive conditions] or 1 uM 32 P-dGTP (720 Ci/mmol) [for non-processive], and 1 uM 
primer (i.e., H3.03, supra) at 30°C [for the processive reaction] or 37°C [for the 
non-processive reaction] for 180 minutes. The telomeric DNA synthesized was isolated by 
standard procedures and separated on a 8 % polyacrylamide, 8 M urea gel sequencing gel. 
The processive reaction showed a weak six nucleotide ladder consistent with a processive 
20 telomerase reaction, and the non-processive reaction added one repeat, a pattern equivalent to 
a control reaction with a native telomerase preparation. Conventional assays using IVR 
telomerase are useful in screens for telomerase modulators, as described herein, as well as 
other uses such as elucidation of the structural and functional properties of telomerase. 

25 E. In vitro Reconstituted Telomerase Recognizes Primer 3* Termini 

This experiment demonstrates that IVR telomerase recognizes primer 3' 
termini equivalently to native (purified) telomerase. Telomerase forms a base-paired duplex 
between the primer 3* end and the template region of hTR and adds the next specified 
nucleotide (Morin, 1989, supra). To verify that IVR (recombinant) telomerase has the same 
30 property, the reactions of primers with — GGG or — TAG 3 r termini 

(AATCCGTCGAGCAGAGGG and AATCCGTCGAGCAGATAG) were compared to a 
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primer having a — GTT 3' terminus (M2 supra) using IVR and native telomerase assayed by 
the two step conventional/TRAP assay detailed above. The product ladders of the — GGG 
and — TAG primers were shifted +4 and +2, respectively, when compared to the standard 
primer ( — GTT 3 ! end), the same effect as was observed with native telomerase. This 
experiment demonstrates IVR and native telomerases recognize primer termini in a similar 
manner. 

These results (along with the results supra showing that IVR telomerase 
possesses both processive and non-processive catalytic activity) indicate that IVR telomerase 
has similar structure and properties compared to native or purified telomerase. 



EXAMPLE 8 
PRODUCTION OF ANTI-hTRT ANTIBODIES 
A. Production of Anti-hTRT Antibodies Against hTRT Peptides 

To produce anti-hTRT antibodies, the following peptides from hTRT were 
1 5 synthesized with the addition of C (cysteine) as the amino terminal residue (see Figure 54). 
S-l : FFY VTE TTF QKN RLF FYR KSV WSK 
S-2: RQH LKR VQL RDV SEA EVR QHR EA 
S-3 : ART FRR EKR AER LTS RVK ALF S VL NYE 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YVV 
20 The cysteine moiety was used to immobilize (i.e., covalently link) the peptides to BSA and 
KLH [keyhole limpet hemocyanin] carrier proteins. The KLH-peptides were used as antigen. 
The BSA-peptide conjugates served as material for ELISAs for testing the specificity of 
immune antisera. 

The KLH-peptide conjugates were injected into New Zealand White rabbits. 

25 The initial injections are made by placing the injectant proximal to the axillary and inguinal 
lymph nodes. Subsequent injections were made intramuscularly. For initial injections, the 
antigen was emulsified with Freund's complete adjuvant; for subsequent injections, Freund's 
incomplete adjuvant was used. Rabbits follow a three week boost cycle, in which 50 ml of 
blood yielding 20-25 ml of serum is taken 10 days after each boost. Antisera against each of 

30 the four peptides recognized the hTRT moiety of recombinant hTRT fusion protein 
(GST-HIS 8 -hTRT-fragment 2426 to 3274); see Example 6) on western blots. 
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Using a partially purified telomerase fraction from human 293 cells 
(approximately 1000-fold purification compared to a crude nuclear extract) that was produced 
as described in PCT application No. 97/06012 and affinity purified anti-S-2 antibodies, a 130 
kd protein doublet could be detected on a western blot. A sensitive chemiluminescence 
detection method was employed (SuperSignal chemiluminescence substrates, Pierce) but the 
signal on the blot was weak, suggesting that hTRT is present in low or very low abundance in 
these immortal cells. The observation of a doublet is consistent with a post-translational 
modification of hTRT, i.e., phosphorylation or glycosylation. 

For affinity purification, the S-2 peptide was immobilized to SulfoLink 
(Pierce, Rockford IL) through its N-terminal Cysteine residue according to the manufacturer's 
protocol. First bleed serum from a rabbit immunized with the KLH-S-2 peptide antigen was 
loaded over a the S-2-SulfoLink and antibodies specifically bound to the S-2 peptide were 
eluted. 

B. Production of Anti-hTRT Antibodies Against hTRT Fusion Proteins 

GST-hTRT fusion proteins were expressed in E. coli as the GST-hTRT 
fragment #4 (nucleotides 3272-4177) and the GST-HIS8 -hTRT fragment #3 (nucleotides 
2426 to 3274) proteins described in Example 6. The fusion proteins were purified as 
insoluble protein, and the purity of the antigens was assayed by SDS polyacrylamide gels and 
estimated to be about 75% pure for the GST-hTRT fragment #4 recombinant protein and 
more than 75% pure for GST-HIS8 -hTRT fragment #3 recombinant protein. Routine 
methods may be used to obtain these and other fusion proteins at a purity of greater than 
90%. These recombinant proteins were used to immunize both rabbits and mice, as described 
above. 

The first and second bleeds from both the mice and rabbits were tested for the 
presence of anti-hTRT antibodies after removal of anti-GST antibodies using a matrix 
containing immobilized GST. The antisera were tested for anti-hTRT antibodies by Western 
blotting using immobilized recombinant GST-hTRT fusion protein, and by 
immunoprecipitation using partially purified native telomerase enzyme. While no signal was 
observed in these early bleeds, titers of anti-hTRT antibodies, as expected, increased in 
subsequent bleeds. 



265 



EXAMPLE 9 

DETECTION OF AN hTRT mRNA CORRESPONDING TO A182 RNA VARIANT 

Poly A + RNA from human testis and the 293 cell line was analyzed for hTRT 
mRNA using RT-PCR and nested primers. The first primer set was TCP 1 . 1 and TCP 1.15; 
the second primer set was TCP1.14 and BTCP6. Amplification from each gave two products 
differing by 182 bp; the larger and smaller products from testis RNA were sequenced and 
found to correspond exactly to pGRNl 21 (Figure 16) and the 712562 clone (Figure 18), 
respectively. The variant hTRT RNA product has been observed in mRNA from SW39i, 
OVCAR4, 293, and Testes. 

Additional experiments were carried out to demonstrate that the A182 cDNA 
was not an artifact of reverse transcription. Briefly, full-length hTRT RNA (i.e., without the 
deletion) was produced by in vitro transcription of pGRN121 for use as a template for RT- 
PCR. Separate cDNA synthesis reactions were carried out using Superscript® reverse 
transcriptase (Bethesda Research Laboratories, Bethesda MD) at 42° or 50 °C, and with 
random-primers or a specific primer. After 15 PCR cycles the longer product was detectable; 
however, the smaller product (i.e., corresponding to the deletion) was not detectable even 
after 30 or more cycles. This indicates that the RT-PCR product is not an artifact. 

EXAMPLE 10 
SEQUENCING OF TESTIS hTRT mRNA 

The sequence of the testis form of hTRT RNA was determined by dir ect 
manual sequencing of DNA fragments generated by PCR from testis cDNA (Marathon Testes 
cDNA, Clontech, San Diego CA) using a ThermoSequenase radiolabeled terminator cycle 
sequencing kit (Amersham Life Science). The PCR step was performed by a nested PCR, as 
shown in Table 8. In all cases a negative control reaction with primers but no cDNA was 
performed. The absence of product in the control reaction demonstrated that the products 
derived from the reaction with cDNA present were not due to contamination of hTRT from 
pGRN121 or other cell sources (e.g., 293 cells). The DNA fragments were excised from 
agarose gels to purify the DNA prior to sequencing. 

The testis mRNA sequence corresponding to bases 27 to 3553 of the 
pGRN121 insert sequence, and containing the entire hTRT ORF (bases 56 to 3451) was 
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obtained. There were no differences between the testis and the pGRN121 sequences in this 
region. 
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EXAMPLE 11 

DETECTION OF hTRT mRNA BY RNASE PROTECTION 

RNase protection assays can be used to detect, monitor, or diagnose the 
presence of an hTRT mRNA or variant mRNA. One illustrative RNAse protection probe is an 

5 in vitro synthesized RNA comprised of sequences complementary to hTRT mRNA sequences 
and additional, non-complementary sequences. The latter sequences are included to 
distinguish the full-length probe from the fragment of the probe that results from a positive 
result in the assay: in a positive assay, the complementary sequences of the probe are 
protected from RNase digestion, because they are hybridized to hTRT mRNA. The 

10 non-complementary sequences are digested away from the probe in the presence of RNase 
and target complementary nucleic acid. 

Two RNAse protection probes are described for illustrative purposes; either 
can be used in the assay. The probes differ in their sequences complementary to hTRT, but 
contain identical non-complementary sequences, in this embodiment, derived from the SV40 

15 late mRNA leader sequence. From 5 T -3', one probe is comprised of 33 nucleotides of 

non-complementary sequence and 194 nucleotides of sequence complementary to hTRT 
nucleotides 2513 - 2707 for a full length probe size of 227 nucleotides. From 5'-3\ the second 
probe is comprised of 33 nucleotides of non-complementary sequence and 198 nucleotides of 
sequence complementary to hTRT nucleotides 2837 - 3035 for a full length probe si2:e of 231 

20 nucleotides. To conduct the assay, either probe can be hybridized to RNA, i.e., polyA+ RNA, 
from a test sample, and Tl ribonuclease and RNase A are then added. After digestion, probe 
RNA is purified and analyzed by gel electrophoresis. Detection of a 194 nucleotide fragment 
of the 227 nucleotide probe or a 198 nucleotide fragment of the 231 nucleotide probe is 
indicative of hTRT mRNA in the sample. 

25 The illustrative RNAse protection probes described in this example can be 

generated by in vitro transcription using T7 RNA polymerase. Radioactive or otherwise 
labeled ribonucleotides can be included for synthesis of labeled probes. The templates for the 
in vitro transcription reaction to produce the RNA probes are PCR products. These 
illustrative probes can be synthesized using T7 polymerase following PCR amplification of 

30 pGRN121 DNA using primers that span the corresponding complementary region of the 
hTRT gene or mRNA. In addition, the downstream primer contains T7 RNA polymerase 
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promoter sequences and the non-complementary sequences. 

For generation of the first RNAse protection probe, the PCR product from the 
following primer pair (T701 and reverseOl) is used: 

T701 5'-GGGAGATCT TAATACGACTCACTATAG ATTCA GGCCATGGTG 
CTGCGCCGGC TGTCA GGCTCCC ACGACGTAGT CCATGTTCAC-3'; and reverseOl 
5 '-GGGTCT AG AT CCGGAAGAGTGT CTGGAGCAAG-3'. 

For generation of the second RNase protection probe, the PCR product from 
the following primer pair (T702 and reverse02) is used: 

T702 5-GGGAGATCT TAATACGACTCACTATAG ATTCA GGCCATGGTG 
CTGCGCCGGC TGTCA GGGCG GCCTTCTGGA CCACGGCATA CC-3'; and reverse02 
5'-G GTCTAGA CGATATCC ACAGGGCCTG GCGC-3'. 

EXAMPLE 12 

CONSTRT TCTTON OF A PHYLOGENETIC TREE COMPARING hTRT AND 
OTHER REVERSE TRANSCRIPTASES 

A phylogenetic tree (Figure 6) was constructed by comparison of the seven RT 
domains defined by Xiong and Eickbush (1 990, EMBO J. 9:3353). After sequence alignment 
of motifs 1, 2, and A-E from 4 TRTs, 67 RTs, and 3 RNA polymerases, the tree was 
constructed using the NJ (Neighbor Joining) method (Saitou and Nei, 1987, Mol. Biol. Evol. 
4:406). Elements from the same class that are located on the same branch of the tree are 
simplified as a box. The length of each box corresponds to the most divergent element within 
that box. 

The TRTs appear to be more closely related to RTs associated with msDNA, 
group II introns, and non-LTR (Long Terminal Repeat) retrotransposons than to the LTR- 
retrotransposon and viral RTs. The relationship of the telomerase RTs to the non-LTR 
branch of retroelements is intriguing, given that these latter elements have replaced 
telomerase for telomere maintenance in Drosophila. However, the most striking finding is 
that the TRTs form a discrete subgroup, almost as closely related to the RNA-dependent 
RNA polymerases of plus-stranded RNA viruses such as polio virus as to any of the 
previously known RTs. Considering that the four telomerase genes come from evolutionarily 
distant organisms - protozoan, fungi, and mammal ~ this separate grouping cannot be 
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explained by lack of phylogenetic diversity in the data set. Instead, this deep bifurcation 
suggests that the telomerase RTs are an ancient group, perhaps originating with the first 
eukaryote. 

GenBank protein identification or accession numbers used in the phylogenetic 
5 analysis were: msDNAs (94535, 134069, 134074,134075, 134078), group II introns (483039, 
101880, 1332208, 1334433, 1334435, 133345, 1353081), mitochondrial plasmid/RTL 
(903835, 134084), non-LTR retrotransposons (140023, 84806, 103221, 103353, 134083, 
435415, 103015, 1335673, 85020, 141475, 106903, 130402, U0551, 903695, 940390, 
2055276, L08889), LTR retrotransposons (74599, 85105, 130582, 99712, 83589, 84126, 
10 479443, 224319, 130398, 130583, 1335652, 173088, 226407, 101042, 1078824), 

hepadnaviruses (1 18876, 1706510, 118894), caulimoviruses (331554, 130600, 130593, 
93553), retroviruses (130601, 325465, 74601, 130587, 130671, 130607, 130629, 
130589,130631, 1346746, 130651, 130635, 1780973, 130646). Alignment was analyzed 
using ClustalW 1.5 [J. D. Thompson, D. G. Higgins, T. J. Gibson, Nucleic Acids Res. 22, 
15 4673 (1994)] and PHYLIP 3.5 [J. Felsenstein, Cladisfics 5, 164 (1989)]. 

EXAMPLE 13 

TR ANSFFCTTON OF CITT/TTIRED WTTMAN FIB ROBLASTS (BJ> WITH CONTROL 
PTASMTD AND PLASMID F.NCODTNG hTRT 

20 This example demonstrates that expression of recombinant hTRT protein in a 

mammalian cell results in the generation of an active telomerase. 

Subconfluent BJ fibroblasts were trypsinized and resuspended in fresh 
medium (DMEM/199 containing 10% Fetal Calf Serum) at a concentration of 4 x 10 6 
cells/ml. The cells were transfected using electroporation with the BioRad Gene Pulser™ 

25 electroporator. Optionally, one may also transfect cells using Superfect™ reagent (Qiagen) in 
accordance with the manufacturer's instructions. For electroporation, 500 ul of the cell 
suspension were placed in an electroporation cuvette (BioRad, 0.4 cm electrode gap). Plasmid 
DNA (2 ug) was added to the cuvettes and the suspension was gently mixed and incubated on 
ice for 5 minutes. The control plasmid (pBBS212) contained no insert behind the MPSV 

30 promoter and the experimental plasmid (pGRN133) expressed hTRT from the MPSV 

promoter. The cells were electroporated at 300 Volts and 960 uFD. After the pulse was 
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delivered, the cuvettes were placed on ice for approximately 5 minutes prior to plating on 100 
mm tissue culture dishes in medium. After 6 hours, the medium was replaced with fresh 
medium. 72 hours after the transfection, the cells were trypsinized, washed once with PBS, 
pelleted and stored frozen at -80°C. Cell extracts were prepared at a concentration of 25,000 
5 cells/(il by a modified detergent lysis method (see Bodnar et aL, 1996, Exp. Cell Res. 228:58; 
Kim et aL, 1994, Science 266:201 1, and as described in patents and publications relating to 
the TRAP assay, supra) and telomerase activity in the cell extracts was determined using a 
modified PCR-based TRAP assay (Kim et al., 1994, Bodnar et aL, 1996). Briefly, 5 xlO 4 cell 
equivalents were used in the telomerase primer extension portion of the reaction. While the 

1 0 extract is typically taken directly from the telomerase extension reaction to the PCR 

amplification, one may also extract once with phenol/chloroform and once with chloroform 
prior to the PCR amplification. One-fifth of the material was used in the PCR amplification 
portion of the TRAP reaction (approximately 10,000 cell equivalents). One half of the TRAP 
reaction was loaded onto the gel for analysis, such that each lane in Figure 25 represents 

15 reaction products from 5,000 cell equivalents. Extracts from cells transfected with 
pGRN133 were positive for telomerase activity while extracts from untransfected (not 
shown) or control plasmid transfected cells showed no telomerase activity. Similar 
experiments using RPE cells gave the same result. 

Reconstitution in B J cells was also carried out using other hTRT constructs 

20 (i.e., pGRN145, pGRN155 and pGRN138). Reconstitution using these constructs appeared 
to result in more telomerase activity than in the pGRN133 transfected cells. 

The highest level of telomerase activity was achieved using pGRN155. As 
discussed supra, pGRN155 is a vector containing the adenovirus major late promoter as a 
controlling element for the expression of hTRT and was shown to reconstitute telomerase 

25 activity when transfected into B J cells. 

Notably, when reconstitution using the hTRT-GFP fusion protein pGRN138 
(which localizes to the nucleus, see Example 1 5, infra) was performed either in vitro (see 
Example 7) or in vivo (transfection into B J cells) telomerase activity resulted. By transfection 
into B J cells, for example, as described supra, telomerase activity was comparable to that 

30 resulting from reconstitution in vitro using pGRN133 or pGRN145. 

Similar results were obtained upon transfection of normal human retinal 
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pigmented epithelial (RPE) with the hTRT expression vectors of the invention. The 
senescence of RPE cells is believed to contribute to or cause the disease of age-related 
macular degeneration. RPE cells treated in accordance with the methods of the invention 
using the hTRT expression vectors of the invention should exhibit delayed senescence, as 
5 compared to untreated cells, and so be useful in transplantation therapies to treat or prevent 
age-related macular degeneration. 

EXAMPLE 14 
PROMOTER REPORTER CONSTRUCT 

10 This example describes the construction of plasmids in which reporter genes 

are operably linked to hTRT upstream sequences containing promoter elements. The vectors 
have numerous uses, including identification of cis and trans transcriptional regulatory 
factors in vivo and for screening of agents capable of modulating (e.g., activating or 
inhibiting) hTRT expression (e.g., drug screening). Although a number of reporters may be 

15 used (e.g., firefly luciferase, P-glucuronidase, P-galactosidase, chloramphenicol acetyl 
transferase, andGFP and the like), the human secreted alkaline phosphatase (SEAP; 
CloneTech) was used for initial experiments. The SEAP reporter gene encodes a truncated 
form of the placental enzyme which lacks the membrane anchoring domain, thereby allowing 
the protein to be secreted efficiently from transfected cells. Levels of SEAP activity detected 

20 in the culture medium have been shown to be directly proportional to changes in intracellular 
concentrations of SEAP mRNA and protein (Berger et al., 1 988, Gene 66: 1 ; Cullen et al., 
1992, Meth Enzymol 216:362). 

Four constructs (pGRN148, pGRN150, "pSEAP2 basic" (no promoter 
sequences = negative control) and t! pSEAP2 control" (contains the SV40 early promoter and 

25 enhancer) were transfected in triplicate into mortal and immortal cells. 

Plasmid pGRN148 was constructed as illustrated in Figure 9. Briefly, a Bgl2- 
Eco47III fragment from pGRN144 was digested and cloned into the Bglll-Nrul site of 
pSeap2Basic (Clontech, San Diego, CA). A second reporter-promoter, plasmid pGRN150, 
includes sequences from the hTRT intron described in Example 3, to employ regulatory 

30 sequences that may be present in the intron. The initiating Met is mutated to Leu, so that the 
second ATG following the promoter region will be the initiating ATG of the SEAP ORF. 
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The pGRN148 and pGRN150 constructs (which include the hTRT promoter) 



were transfected into mortal (BJ cells) and immortal (293) cells. All transfections were done 
in parallel with two control plasmids: one negative control plasmid (pSEAP basic) and one 
positive control plasmid (pSEAP control which contains the SV40 early promoter and the 

5 SV40 enhancer). 

In immortal cells, pGRN148 and pGRN150 constructs appear to drive SEAP 
expression as efficiently as the pSEAP2 positive control (containing the S V40 early promoter 
and enhancer). In contrast, in mortal cells only the pSEAP2 control gave detectable activity. 
These results indicate that, as expected, hTRT promoter sequences are active in tumor cells 

1 0 but not in mortal cells. 

Similar results were obtained using another normal cell line (RPE, or retinal 
pigmental epithelial cells). In RPE cells transfected with pGRN150 (containing 2.2 KB of 
upstream genomic sequence), the hTRT promoter region was inactive while the pSEAP2 
control plasmid was active. 

15 As noted supra, plasmids in which reporter genes are operably linked to hTRT 

upstream sequences containing promoter elements are extremely useful for identification and 
screening of telomerase activity modulatory agents, using both transient and stable 
transfection techniques. In one approach, for example, stable transformants of pGRN148 are 
made in telomerase negative and telomerase positive cells by cotransfection with a eukaryotic 

20 selectable marker (such as neo) according to Ausubel et al., 1997, supra. The resulting cell 
lines are used for screening of putative telomerase modulatory agents, for example, by 
comparing hTRT-promoter-driven expression in the presence and absence of a test 
compound. 



25 identify trans- and cis-acting transcriptional and translational regulatory elements. Examples 
of cis-acting transcriptional regulatory elements include promoters and enhancers of the 
telomerase gene. The identification and isolation of cis- and trans- acting regulatory agents 
provide for further methods and reagents for identifying agents that modulate transcription 
and translation of telomerase. 

30 To identify sequences or elements that play a role in hTRT expression, 

expression was tested using promoter-reporter constructs with varying amounts of the 



The promoter-reporter (and other) vectors of the invention are also used to 
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upstream region (5 ! to the transcription initiation site) of the hTRT gene. Experiments were 
conducted using pGRN 150 [which contains approximately 2405 bp of genomic sequence 
upstream of the most 5 ! nucleotide present in the hTRT cDNA], pGRN 176 [which contains 
approximately 186 bp of genomic sequence upstream of the most 5' nucleotide present in the 

5 hTRT cDNA] and pGRN 175 [which contains approximately 77 bp of genomic sequence 
upstream of the most 5* nucleotide present in the hTRT cDNA]. The following sequence is 
present in pGRN 176 but not pGRN 175: 5-GTGGCGGAGGGACTGGGGACCCGGGC 
ACCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCC 
CGTCCCGAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGG-S'. 

10 When transfected into mortal cells (RPE and BJ), the pGRN 175 promoter was 

active, while the pGRN 176 and pGRN 150 promoters were not active. These results 
demonstrate that the approximately 120 basepair region present in pGRN 176 but not pGRN 
175 includes sequences that play a role in the mortal-cell specific repression of hTRT gene 
expression is achieved. It will be recognized that less than the entire approximately 120 

1 5 basepair sequence may be required for this effect, and that other sequences not in the 

approximately 120 base pair region may also play a role (independently or in combination 
with the approximately 120 base pair region) in regulation of hTRT expression. Thus, the 
approximately 120 base pair region includes all or part of one or more cis-acting elements. 
Without intending to be bound by any particular mechanism, the 

20 approximately 120 base pair sequence includes a binding site for a repressor (e.g., a trans 
acting repressor) which upon binding prevents initiation of transcription of the hTRT gene. 
Such a repressor may be the product of an anti-oncogene (e.g., a novel anti-oncogene), which 
can be identified and cloned in accordance with the teachings herein and the use of the novel 
reagents disclosed herein. In normal cells, repressor binding or interaction with hTRT 

25 regulatory sequences (e.g., including or within the approximately 120 base pair sequence) 
results in the absence of hTRT protein and therefore of telomerase activity. Activation of 
telomerase in cancer cells can result from the loss of hTRT repressor activity. 

A number of applications of the "approximately 120 base pair region" 
described above will be immediately apparent upon review of this disclosure, including for 

30 treatment or diagnosis of telomerase related diseases and identification of agents with 

telomerase modulatory activity. For example, using standard techniques, the sequence may 
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be used to identify agents or proteins (e.g. naturally occurring repressor proteins) that 
specifically bind to the approximately 120 base pair sequence or a subsequence thereof. In 
addition, synthetic or naturally occurring agents that increase or stabilize repression (e.g., by 
binding or otherwise interacting with the sequence, by stabilizing binding by a naturally 
5 occurring repressor, or by other means) will be useful for reducing telomerase activity in a 
cell (e.g., for treatment of malignancy). Similarly, agents that reduce repression (e.g., by 
inhibiting repressor binding, or by other means) will be useful for increasing telomerase 
expression (e.g., by controlled activation), for example to increase the proliferative capacity 
of normal cells). 

10 

EXAMPLE 15 
SUBCELLULAR LOCALIZATION OF hTRT 

A fusion protein having hTRT and enhanced green fluorescent protein (EGFP; 
Cormack et al., 1996, Gene 173:33) regions was constructed as described below. The EGFP 
15 moiety provides a detectable tag or signal so that the presence or location of the fusion 
protein can be easily determined. Because EGFP-fusion proteins localize in the correct 
cellular compartments, this construct may be used to determine the subcellular location of 
hTRT protein. 

20 A. Construction of pGRN138 

A vector for expression of an hTRT-EGFP fusion protein in mammalian cells 
was constructed by placing the EcoRI insert from pGRN124 (see Example 6) into the EcoRI 
site of pEGFP-C2 (Clontech, San Diego, CA). The amino acid sequence of the fusion protein 
is provided below. EGFP residues are in bold, residues encoded by the 5' untranslated region 
25 of hTRT mRNA are underlined, and the hTRT protein sequence is in normal font. 

MVSKGEELFTGVVP ILVELDGDVNGHKFS VSGEGEGD ATYGKLTLKF I CTTGKLP VPWPT 
LVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTL 
VNRIELKGIDFKEDGNILGHKLEYNYNSH1TVYIMADKQKNGIKVNFKIRHNIEDGSVQIA 
DHYQQNTPIGDGPVLLPDNHYIaSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDEIiYKS 

30 GRTQTSSS5FEFAAAST ORCVLLRTWEAIAPATPAM PRAPRCRAVRSLLRSHYREVLPIA 
TFVRRIjGPQGWRLVQRGDPAAFRALVAQCIjVCVPWDARPPPAAPS FRQVS CLKELVARVL 
QRLCERGAKNVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALRGSGAWGLLLRRVG 
DDVL VHLLARCALFVL VAP S CAYQVCGP PL YQLGAATQARP P PHAS GPRRRLGCERAWNH 
SVREAGVPLGLPAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGP 

35 SDRGFCWSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVY 
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AETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQR 
YWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDP 
RRLVQLLRQHS S PWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFI SLGKHAKLSL 
QELTWKMSVRDCAWLRRS PGVGCVPAAEHRLREE I LAKFLHWLMS VYWELLRS FFYVTE 
5 TTFQKNRLFFYRPSVWSKLQS IGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFI P 
KPDGLRPIVl^DYWGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDD 
IHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTI PQDRLTEVIASI IKPQNTYCVRRYA 
WQKAAHGHVRKAFKSHVSTLTDLQP YMRQFVAHLQETS PLRDAWI EQS S SLNE AS SGL 
FDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLR 
1 0 LVDDFLLVTPHLTHAKTFLRTLWGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAH 
GLFPWCGLLLDTRTLEVQSDYSSYARTSIRASVTFNRGFKAGRl^RRKLFGVLRLKCHSL 
FLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYS 
I LKAKNAGMS LGAKGAAGPL P S E AVQWLCHQAFLLKLTRHRVTYVPLLGS LRTAQTQLS R 
KLPGTTLTALEAAANPALPSDFKTILD 

15 

Other EGFP fusion constructs can be made using partial (e.g., truncated) hTRT coding 
sequence and used, as described infra, to identify activities of particular regions of the hTRT 
polypeptide. 



20 B. Nuclear Localization and Uses of pGRN138 

Transfection of NIH 293 and BJ cells with pGRN138 confirmed the nuclear 
localization of recombinantly expressed hTRT. Cells were transfected with pGRN138 
(EGFP-hTRT) and with a control construct (expressing EGFP only). Nuclear localization of 
the EGFP-hTRT is apparent in both cell types by fluorescence microscopy. As noted supra, 

25 the pGRN138 hTRT-GFP fusion protein supports reconstitution of telomerase activity in both 
an in vitro transcription translation system and in vivo when transfected into BJ cells. 

The hTRT-EGFP fusion proteins (or similar detectable fusion proteins) can be 
used in a variety of applications. For example, the fusion construct described in this example, 
or a construct of EGFP and a truncated form of hTRT, can be used to assess the ability of 

30 hTRT and variants to enter a cell nucleus and/or localize at the chromosome ends. In 
addition, cells stably or transiently transfected with pGRN138 are used for screening 
compounds to identify telomerase modulatory drugs or compounds. Agents that interfere 
with nuclear localization or telomere localization can be identified as telomerase inhibitors. 
Tumor cell lines stably expressing EGFP-hTRT can be useful for this purpose. Potential 

35 modulators of telomerase will be administered to these transfected cells and the localization 
of the EGFP-hTRT will be assessed. In addition, FACS or other fluorescence-based methods 
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can be used to select cells expressing hTRT to provide homogeneous populations for drug 
screening, particularly when transient transfection of cells is employed. 

In other applications, regions of the hTRT can be mutagenized to identify 
regions (e.g., residues 193-196 (PRRR) and 235-240 (PKRPRR)) required for nuclear 
5 localization, which are targets for anti-telomerase drugs (telomerase activity modulators). 
Other applications include: 

use of the fusion protein as a fluorescent marker of efficient cell transfection 
for both transient transfection experiments and when establishing stable cell lines expressing 
EGFP-hTRT; 

I o expression of an hTRT-EGFP fusion with mutated nuclear localization signals 

(deficient for nuclear localization) in immortal cells so that the hTRT mutant-EGFP 
scavenges all the hTR of the immortal cells, retaining it in the cytoplasm and preventing 
telomere maintenance; and 

use as a tagged protein for immunoprecipitation. 

15 

EXAMPLE 16 

EFFECT OF MUTATION ON TELOMERASE CATA LYTIC ACTIVITY 

This example describes hTRT variant proteins having altered amino acids and 

altered telomerase catalytic activity. Amino acid substitutions followed by functional 
20 analysis is a standard means of assessing the importance and function of a polypeptide 

sequence. This example demonstrates that changes in the reverse transcriptase (RT) and 

telomerase (T) motifs affect telomerase catalytic activity. 

Conventional nomenclature is used to describe mutants: the target residue in 

the native molecule (hTRT) is identified by one-letter code and position, and the 
25 corresponding residue in the mutant protein is indicated by one-letter code. Thus, for 

example, "K626A" specifies a mutant in which the lysine at position 626 (i.e., in motif 1) of 

hTRT is changed to an alanine. 

A. Mutation of hTRT FFYxTE Motif 

30 In initial experiments, a vector encoding an hTRT mutant protein, "F560A," 

was produced in which amino acid 560 of hTRT was changed from phenylalanine (F) to 
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alanine (A) by site directed mutagenesis of pGRN121 using standard techniques. This 
mutation disrupts the TRT FFYxTE motif. The resulting F560A mutant polynucleotide was 
shown to direct synthesis of a full length hTRT protein as assessed using a cell-free 
reticulocyte lysate transcription/translation system in the presence of 35 S-methionine, 
5 When the mutant polypeptide was co-translated with hTR, as described in 

Example 7, no telomerase activity was detected as observed by TRAP using 20 cycles of 
PCR, while a control hTRT/hTR cotranslation did reconstitute activity. With 30 cycles of 
PCR in the TRAP assay, telomerase activity was observable with the mutant hTRT, but was 
considerably lower than the control (wild-type) hTRT. 

10 

B. Additional Site-Directed Mutagenesis of hTRT Amino Acid Residues 

Conserved amino acids in six RT motifs were changed to alanine using 
standard site directed mutagenesis techniques (see, e.g., Ausubel, supra) to assess their 
contribution to catalytic activity. The mutants were assayed using IVR telomerase using the 

15 two step conventional/TRAP assay detailed in example 7. 

The K626A (motif 1), R631A (motif 2), D712A (motif A), Y717A (motif A), 
D868A (motif C) mutants had greatly reduced or undetectable telomerase activity (<1% of 
wild-type), while the Q833A (motif B) and G932A (motif E) mutants exhibited 
low/intermediate levels of activity (<10% of wild-type). Two mutations outside the RT 

20 motifs, R688A and D897A, had activity equivalent to wild type hTRT. These results were 
consistent with analogous mutations in reverse transcriptases (Joyce et al., 1994, Ann. Rev. 
Biochem. 63:777) and are similar to results obtained with Est2p (see Lingner, 1997, Science 
276:561). The experiments identify residues in the RT motifs critical and not critical for 
enzymatic activity and demonstrate that hTRT is the catalytic protein of human telomerase. 

25 The mutations provide variant hTRT polypeptides that have utility, e.g., as dominant/negative 
regulators of telomerase activity. 

Amino acid alignment of the known TRTs identified a telomerase-specific 
motif, motif T (see supra). To determine the catalytic role of this motif in hTRT, a six amino 
acid deletion in this motif (A560-565; FFYxTE), was constructed using standard site directed 

30 mutagenesis techniques (Ausubel, supra). The deletion was assayed using IVR telomerase 
using the two step conventional/TRAP assay detailed in Example 7. The A560-565 mutant 
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had no observable telomerase activity after 25 cycles of PCR whereas wild type hTRT IVR 
telomerase produced a strong signal. Each amino acid in each residue in motif T was 
examined independently in a similar manner; mutants F560A, Y562A, T564A, and E565A 
retained intermediate levels of telomerase activity, while a control mutant, F487A, had 

5 minimal affect on activity. Notably, mutant F561A had greatly reduced or undetectable 
telomerase activity, while activity was fully restored in its "revertant", F561 A561F. 
F561A561Fchanges the mutated position back to its original phenylalanine. This is a control 
that demonstrates that no other amino acid changes occurred to the plasmid that could 
account for the decreased activity observed. Thus, the T motif is the first non-RT motif 

1 0 shown to be absolutely required for telomerase activity. 

Motif T can be used for identification of TRTs from other organisms and 
hTRT proteins comprising variants of this motif can be used as a dominant/negative regulator 
of telomerase activity. Unlike most other RTs, telomerase stably associates with and 
processively copies a small portion of a single RNA (ie. hTR), thus motif T can be involved 

1 5 in mediating hTR binding, the processivity of the reaction, or other functions unique to the 
telomerase RT. 

In other experiments, it was observed that the deletion variant encoded by 
pro90hTRT described herein, did not reconstitute telomerase activity when co-synthesized 
with hTR, as measured using a modified TRAP assay (Autexier et al. ? 1996, EMBO Journal 
20 15:5928, which is incorporated herein by reference). 

Example 17 

SCREENING FOR TELOMERASE ACTIVITY MODULATORS USIN G 
RECOMBINANTLY EXPRESSED TELOMERASE COMPONENTS 

25 This example describes the use of in vitro reconstituted telomerase for 

screening and identifying telomerase activity modulators. The assay described is easily 
adapted to high-through-put methods (e.g., using multiple well plates and/or robotic systems). 
Numerous variations on the steps of the assay will be apparent to one of skill in the art after 
review of this disclosure. 

30 Recombinant clones for telomerase components (e.g., hTRT and hTR) are 

transcribed and translated (hTRT only) in an in vitro reaction as follows and as described in 
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Example 7 supra, using the TNT® T7 Coupled Reticulocyte lysate system (Promega), which 
is described in U.S. Patent No. 5,324,637, following the manufacturer's instructions: 

Reagent Amount per reaction ( \xL) 

5 TNT Rabbit Reticulocyte lysate 25 

TNT reaction buffer 2 

TNT T7 RNA Pol. 1 

AA mixture (complete) 1 

Prime RNase inhibitor 1 

1 0 Nuclease-free water 1 6 

Xbal cut pGRN121 [ hTRT] (0.5 \xg) 2 

Fspl cut pGRN164 [hTR] (0.5 Hg) 2 



15 The reaction is incubated at 30°C for 2 hours. The product is then purified on an 
ultrafree-MC DEAE filter (Millipore). 

The recombinant telomerase product (IVRP) is assayed in the presence and 
absence of multiple concentrations of test compounds which are solubilized in DMSO (e.g. 
10 jiM - 100 jiM). Test compounds are preincubated in a total volume of 25 \iL for 30 

20 minutes at room temperature in the presence of 2.5 (aL IVRP, 2.5% DMSO, and IX TRAP 
Buffer (20 mM Tris-HCl, pH 8.3, 1.5mM MgCl 2 , 63 mM KC1, 0.05%Tween20, 1.0 mM 
EGTA, 0.1 mg/ml Bovine serum albumin). Following the preincubation, 25 |aL of the TRAP 
assay reaction mixture is added to each sample. The TRAP assay reaction mixture is 
composed of IX TRAP buffer, 50fiL dNTP, 2.0 [ig/ml primer ACX, 4 ^g/ml primer U2, 0.8 

25 attomol/ml TSU2, 2 units/5 0\il Taq polymerase (Perkin Elmer), and 2 jxg/ml 

[ 32 P]5 f end-labeled primer TS (3000Ci/mmol). The reaction tubes are then placed in the PCR 
thermocycler (MJ Research) and PCR is performed as follows: 60 min at 30°C, 20 cycles of 
{30 sec at 94°C, 30 sec, at 60°C, 30 sec. at 72°C}, 1 min at 72°C, cool down to 10°C. The 
TRAP assay is described, as noted supra, in U.S. Patent No. 5,629,154. The primers and 

30 substrate used have the sequences: TS Primer (5'-AATCCGTCGAGCAGAGTT-3'); ACX 
Primer (5'-GCGCGG[CTTACC]3CTAACC-3'); U2 primer 



HUH " 




(5^ATCGCTTCTCGGCCTTTT-3*); TSU2 

(5'-AATCCGTCGAGCAGAGTTAAAAGGCCGAGAAGCGAT-3') 

After completion of the PCR step, 4 ul of 10X loading buffer containing 



bromophenol blue is added to each reaction tube and products (20 al) are run on a 12.5% 
5 non-denaturing PAGE in 0.5X TBE at 400 V. The completed gel is subsequently dried and 
the TRAP products are visualized by Phosphorimager or by autoradiography. The telomerase 
activity in the presence of the test compound is measured by comparing the incorporation of 
label in reaction product to a parallel reaction lacking the agent. 

The following clones described in the Examples have been deposited with the 
American Type Culture Collection, Rockville, MD 20852, USA: 
Lambda phage 25-1.1 ATCC accession number 209024 



and diagnosis and treatment of telomerase-related diseases. While specific examples have 
20 been provided, the above description is illustrative and not restrictive. Many variations of the 

invention will become apparent to those of skill in the art upon review of this specification. 

The scope of the invention should, therefore, be determined not with reference to the above 

description, but instead should be determined with reference to the appended claims along 

with their full scope of equivalents. 
25 All publications and patent documents cited in this application are 

incorporated by reference in their entirety for all purposes to the same extent as if each 

individual publication or patent document were so individually denoted. 
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pGRN121 ATCC accession number 209016 

Lambda phage XG&5 ATCC accession number 98505 



The present invention provides novel methods and materials relating to hTRT 
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WHAT IS CLAIMED IS: 



1 . An isolated, substantially pure, or recombinant protein preparation of 
a human telomerase reverse transcriptase (hTRT) protein, or a variant thereof, or a fragment 

5 thereof. 

2. An isolated, synthetic, substantially pure, or recombinant 
polynucleotide that is at least ten nucleotides to 3kb in length and comprises a contiguous 
sequence of at least ten nucleotides that is identical or exactly complementary to a contiguous 

10 sequence encoding a recombinant protein of claim 1 . 

3. The polynucleotide of claim 2 that encodes an hTRT protein or 

fragment 

15 4. A method of identifying a compound that modulates hTRT activity, 

said method comprising the steps of contacting an hTRT protein of claim 1 with said 
compound and measuring a change in a property or activity of said hTRT, wherein a 
statistically significant change in said property or activity identifies said compound as a 
modulator of hTRT activity. 

20 

5. The method of claim 4 wherein the compound is an inhibitor of hTRT 

activity. 

6. A method of preparing recombinant telomerase, said method 
25 comprising contacting a recombinant hTRT protein of claim 1 with a telomerase RN A 

component under conditions such that said recombinant protein and said telomerase RNA 
component associate to form a telomerase enzyme capable of catalyzing the addition of 
nucleotides to a telomerase substrate, 

30 7. The method of claim 6, wherein the hTRT protein has a sequence of 

Figure 17. 
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8. The method of claim 7, wherein the hTRT protein is produced in an in 
vitro expression system. 

9. The method of claim 6, wherein a said hTRRT protein is substantially 
5 purified before said contacting. 

10. A method for increasing the proliferative capacity of a vertebrate cell 
by introducing a recombinant hTRT polynucleotide of claim 3 into the cell, and wherein said 
sequence is operably linked to a promoter. 

10 

11. A method of detecting the presence of at least one telomerase positive 
human cell in a biological sample comprising human cells, said method comprising the steps: 

a) measuring the amount of an hTRT gene product in said 

sample, 

15 b) comparing the amount measured with a control correlating 

to a sample lacking telomerase positive cells, 

wherein the presence of a higher level of the hTRT gene product in said 
sample as compared to said control is correlated with the presence of telomerase positive cells 
in the biological sample. 

20 

12. The method of claim 1 1 , wherein said telomerase positive cells are 

cancer cells. 

13. The method of claim 1 1, wherein the amount of an hTRT gene 
25 product is measured using an antibody. 

1 4. The method of claim 1 1 , wherein the amount of an hTRT gene 
product is measured using a nucleotide probe. 

30 
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15. The method of claim 11, wherein said detecting involves diagnosing a 
telomerase-related condition in a patient, and said method further comprises the steps of: 

a) obtaining a cell or tissue sample from the patient; 

b) measuring the amount of an hTRT gene product in the cell 

5 or tissue; and, 

c) comparing the amount of hTRT gene product in the cell or 
tissue with the amount in a healthy cell or tissue of the same type; 

wherein a different amount of hTRT gene product in the sample from 
the patient and the healthy cell or tissue is diagnostic of a telomerase-related condition. 

10 

16. The method of claim 1 5 wherein the amount is higher in said sample 
than in said healthy cell or tissue and said telomerase-related condition is cancer. 

17. A method for treatment of a condition associated with an elevated 
15 level of telomerase activity within a cell, comprising introducing into said cell a 

therapeutically effective amount of an inhibitor of said telomerase activity, wherein said 
inhibitor is an hTRT polypeptide, an antibody that binds hTRT, or an hTRT polynucleotide. 

18. The method of claim 1 7, wherein the inhibitor is an oligonucleotide 
20 comprising the sequence of Figure 17 or a subsequence or variant thereof. 

1 9. The method of claim 1 8, wherein the oligonucleotide comprises 
nonstandard or derivatized bases or linkages between bases. 

25 20. The method of claim 1 7, wherein the inhibitor is a polynucleotide that 

inhibits binding of endogenous hTRT to hTR. 
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ABSTRACT 



The invention provides compositions and methods related to human telomerase reverse 
transcriptase (hTRT), the catalytic protein subunit of human telomerase. The polynucleotides 
and polypeptides of the invention are useful for diagnosis, prognosis and treatment of human 
diseases, for changing the proliferative capacity of cells and organisms, and for identification and 
screening of compounds and treatments useful for treatment of diseases such as cancers. 
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Figure 1 
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Seq. ID. No 7 



1 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 



NFkB_CSl 
GGGRQTYYQC 
NFkB-MHC-I . 2 
TGGGCTTCCCC 



1 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 



1 GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB_CSl 
GGGRQTYYQC 
NFkB_CS2 
RGGGRMTYYCC 

Topo_ I I_c 1 eavage_s ice 
RNYNNCNNGYNGKTNYNY 



1 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 
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1 AAAACCCCAA AACCCCAAAA ccccttttag AGCCCTGCAG TTGGAAATAT 

5 i AACCTCAGTA TTAATAAGCT CaGATTTTAA ATATTAATTA CAAAACC7AA 

! 0 1 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 

: 5 I TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGT7G TACTCTTGGA 

20 1 TCCAGAAAGT TATTAGATGA AG AAATCAAT CTCAAAGTCA TTATAAAGAT 

25 1 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTG7TG CTACTCCACG 

301 AGAC7ATAAT GAAGAAGATT TTAA AGTTAT TGCAAGAAAA GAAGTATTTT 

3 5 i C AACTGGACT AATGATCG AA CTTATTG AC A AATGCTTAGT TGAACTTCTT 

JO 1 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 

J 5 I ACTTAAGGGA A ATCAATTAG C AA AGACCCA TTTATTAACA GCTCTT7CAA 

50 1 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 

5 5 I A TTGG AAA TG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCC A 

60 1 GC GAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 

65 I ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTCGAGCA 

7 0 1 GCAGACATGA aTGAACCTCC ATGTTGATCA ACCTGCAAAT ACA.ATGTC.AA 

"I GAATGAGAAA.GATCACTTTC TCAACAACAT caacgtgccg aattggaata 

ioi atatgaaatc aagaaccaga atattttatt gcactcattttaatagaaat 

3 5 i AACCAATTCT TC.AAAA.AGCA TG AGTTTGTG AGTAACAAAA ACAATATTTC 

?0 i AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AG atttaata 

Q 5 1 GAATTAGAAA G A A G CT AAlAA G A T AA G GTT A TCGAAAAAAT TGCCTACATG 

1 00 1 CTTGAGAAAG TCAAAGATTT TA ACTTCAAC TACTATTTAA CAAAATCTTG 

1051 TCCTCTTCCA GAAAATTGGC CGGAACGGAA ACAAAAAATC GAAAACTTGA 

! tOl TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 

i 1 5 I TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 

\ 20 i C.AATATACTC CCCAAAGACTTTTTGACTGG AAGAAACCGT AAGAATTTTC 

1251 A AAAG AAAGT 7AAGAAATAT GTCGAACTAA ACAAGCATGA ACTCATTCAC 

: 30 1 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

! 35 I GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

. ^0 t ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 

.451 CTGATTAGAT GATTTTTCTA TG7CACCGAG CAACAGAAAA GTTACTCCAA 

. 50 I AoACCTATTaC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 

'551 TCGCAGACTT MAGAAGCAA ACGCTTCCTG AGGTCCAAGA AAAAGaGGTT 

■ 60 ! GAAGAATGGA AAAAGTCGCT tggatttgca cctggaaaac tcagactaat 

; 65 1 ACCCAAGAAA actactttcc gtccaattat gactttcaat AAGAAGATTG 

I 70 ! taaattcaga ccggaagact ACAAAATTAA ctacaaatac gaagttattg 

: 75 i aactctcacttaatgcttaa gacattgaag aatagaatgt ttaaagatcc 

i 30 1 ttttggattc gctgttttta actatgatga tgtaatgaaa AAGTATGAGG 

1351 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACTCTTCTTTGCA 

1 90 1 ACTATGGATA TCGAAAAGTC ATATGATAGT GTAAACAGAG AAAAACTATC 
1951 AACATTCCTA AAAACTACTA A ATTACTTTC TTCAGATTTC TGGATTATGA 
2001 CTGCACAAATTCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
205 i TTTAGAAAGA AAGAAATGAA aGATTaTTTT AGACAGAAAT TCCAGAAGAT 

2 1 0 I TGCACTTGAA GCAGGaCAAT ATCCA.aCCTT aTTCAGTGTT CTTG A AAA TG 

2 ! 5 I aaCAAAATGA CTTAAATGCA aaGAAAACaT TAATTGTTGA aGCAAAGCAA 

220 I AGAAATTATT TTAAGAAAGA TaaCTTaCTT caaccagtca ttaatatttg 

2251 CCAATATAATTACAT7AACT TTaaTGGCAA gttttataaa caaacaaaaG 

2301 GAATTCCTCA AGGTCTTTGa GTTTCa TCAA till GTCATC ATTTTATTaT 
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155 ! 
2601 
2651 



2351 GCAACATTAG AGGAAAGC7C CTTAGGATTC CTTAGAGATG AA 7C AA7G AA 
240 ! CCC7 GAAAAT CC.AAATGTTA ATCTTCTAAT GAGACTTACA G ATGACTATC 
-45 1 TTTTGA i » AC A ACTCAAGAG AATAATGCAG TATTGTTTAT 7GAGAAAC77 
2501 ATAAAC GiAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
GACjAGi i i i CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
GTG i iGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
iCAATTGATA 7GAAAAC7C7 7GC777AA7G CCAAATATTA ACTTGAGAAT 
270 1 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 

27i I CAATG iCGC i CAAGAAGAAA CTAAAGTCGT ttttaatgaa taacattacc 
2S0t CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
2S5 I CAACAAGTTA TTTATATCAG GCGGTTACAA atacatgcaa tgagccaaag 
2901 AATACAAGGA ccactttaag aagaacttag ctatgagcag TATGATCGAC 
295 1 ^AGAGGj AT C i AAAATTAT atactctgta ACCAGAGCAT TCTTTAAATA 
3001 CC i iGiG ;GC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 i A CTTTTTC CT i AGCACACTG AAGCACTTTA TTG AAA T A TT CAGCAC a AA A 
i ! 0 1 AAG7ACA : i i i CAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3 ! 5 ! CC7AAAAAG i GA CCAA 7GTC AATCTC7AAT 7C AA7A7G A T GCATAG7CGA 
- 20 ! C i AT7 C i .-A C T7A 7777G GA A AG77AA 777 7CAA77777G 7C7TATATA<^ 
3 25 I i CGGG t i i i G GGG7777GGG G7T77GGGG 



Figure 14 



1 MEVDVDNQAD NHGIKSALKT CEEIKJEAKTL YSWTQKV1RC RNQSQSHYKD 
5 I LEDIKIFAQT NIVATPRDYN EEDFK.VIARK. EVFSTGLMIE LIDKCLVELL 
10! SSSDVSDRQK LQCFGFQLKG NOLAKTHLLT ALSTQKQYFF QDEWNQVRam 
: 5 1 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKXQKGGA 
201 ADMNEPRCCS TCKYNVKNESC DHFLNNINVP NWNNMKSRTR IFYCTKFNRN 
25 ! NQFFKKHEFV SNfCNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKJAYM 
301 LEKVKDFNFN YYLTXSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYML PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
■JO 1 KNLLLEKJNT REIS WMQVET SAfCHFYYFDH ENIYVLWKLL RWTFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGrCLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
55 I NSHLMLKTLK NRMFKDPFGF A VFNYDDVMK KYEEFVCKWK QVGQPKLFrA 
501 TMDEEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNN1VIDSKN 
55 1 FRKKEMKDYF RQKFQKJALE GGQYPTLFSV LENEQNDLNA KJCTLIVEAKO 
"01 RNYFKKDNLL QPVrNlCQYN YrNFNGtCFYK QTKGIPQGLC VSSILSSFYY 
"5 1 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNA VLFIEXLL 

so i [nvsrengf:< fnmkxlotsf pls?s:<faky gmdsveeqni VQDYCDW1G1 
>5i S [DMK TLALM pninlrjegi lctlnlnmqt kkasmwlkkx lksflmnmtt 

?01 ^FrTRKTTTTE DFAMCTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
55 : LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL khfieifstk 
00! KYTFNRVCMI LKAtCEAKLKS DOCQSLIQYD A 
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1 ggcacc?acciaccticcc::c:tcataaqctaacc5ccrcctC5aac?cccc:aaaccc=cggaaatatc:rtacaaca 80 
31 acccaacaacaacaccaagtcaaacCwcaacacgaaggcgttaccagcgaccgacaacactcciaLCCcaccggccgcca 160 
151 c=aagcacaaggacaaaaagaacaacciccctccccccaaagacttttacct:actaactrac::t:caaacatattccg 240 

2 41 ggctcgcctacttttaaccgcggcaccgtiiragccgcracctctagccaaccgcgcgttrcsaccccgicactggacac 320 

3 21 agcccrrggagragcccacagaaacccrcacaaaccticcrqacgagaccacaccsgacncaccacagcccgcgcacaccc 400 
401 iraacatiggagccrtacaccrragacgagccacgticgcacganggagcaccrggraccacccaacgcccgccccgaaaag 480 

4 81 gccgacaaccactcgcaaaaccacgcccctagcggcggcaacccgcgaaagctrticgacgccsgcacacgccragcacg 560 

5 61 actgagacactcaaaaacttccacccaccacaaccccczraacgcggmracLCCCciantttcratccccacgtcgcc 640 
541 ccaaacatgcatcaccccgcaccaggccttcctccgcntcacccccggaaccgcacctccttcaccattccccctaacga 720 
721 acaacccaaactagccccgcctacaaccgacagcagcagaaagaccggcgatcccacccgtgcaacgttatcagcrtaaa 800 
301 cacac:tcgcaaaacatc:atcagccaccaccatacaaaaaaaatcctataactacaaacaccaaccaacat:ctgcggcc 880 
S81 accacrt:aLttaaaacgccacgaccagt:aggacaccrrgcatacacatagccacgcttaatggccacttgcaaccr,gc 958 

9 59 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 

1 M T £ H H T ? K S R I C- Ft r L E U Q ? V 20 

10 19 TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCC CCG CCA AGC TCG 1078 
21 Y L C T L M Q V V Q L V L R G S ? A S S 40 

107? TAT AGC AAT ATA TCC CAA CCC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT' CTT ^13 3 
^ 1 Y s N I C Z R L 3. S D V Q 7 S F S Z F L 60 

113 9 CAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT' CCA 1138 
- 51 K s TVVGr D S K ? D EGVQ r S S ? 80 
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1129 AAA TCC TCA CAC TCA GAG g:acacacac::ri^cr::;ac:;:z::c:ac:c;g?acaqccaaca:a:c;gcao 1272 
31 >; c 5 ? s £ ' 86 

127 3 CTA ATA GCG AA7 GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CCA AGG AAT CTA 1332 
37 L I A M V V X Q r 0 £ S £ £ R R H .V C 1£36 

1333 CTG ATG AAA GGG TTT TCC ATG guaggcat:ccaacc?tcaaat:acc:accc?caaccac:c;::zaaacaga L405 
rQ7 - M X G t S M " ll3 

1406 ::gtac::aaccga:aaag AAT CAT GAA GAT TTT CGA GCC ATG' CAT GTA AAC GGA GTA CAA AAT 1469 
114 NHSDrRAMKV-WGVQN I2 8 

147 0 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 
129 D L V ST F ? NY L I S I L £ 2 K N Wg 143 

1530 CTT TTG TTA GAA AT gcaaacaccgg::aagacgzc?cgcactiz;aacaagaccgacaagcacag T ATC GGC 1601 

149 L L L £ I r G 155 

15 02 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG CCT CTT CCA AAT GAC 166L 

-56 5 0 A M H Y L L S X G S I F £ A L ? N 0 L75 

fill 

, 3- £ a 2 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT CTC 1721 

■■■ J 176 nt r l q r s g r ? l f :< m m v r £ £ t v 195 



. AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 



'< R X R T I £ T S I T Q M X S A R X 215 

■;!j.7a2 GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

lu 2is £ V s W M 3 r S t S R r s 1 ~ y r s s y 235 

i ; -;342 AAG AAG TTT AAG CAA G gcaac:aacac:gc:acccrrcacaaccaacc::ag AT CTA TAT TTT AAC 
I #907 

nj23 6 :-: :< r :< q d l y r m 245 

Cl 908 CAC AT7 TCT GAT CGG AAC ACA CTA CAC ATG TCG CTT CAA TGG ATT CCA AGG 1367 

ilh 246 lhs ^cor:;tvh«wlqw:f?r 265 

1958 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTC GTA 2027 

~SSQ F G L I N A F Q V X Q LHXVI?t-V 285 

20 28 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

236SgSTVV?KRLLKVY?C.rrQT 305 

2 088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306a X R LH R I S L S X V Y MH YC ? Y t 325 

214 8 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

3 26 O T H D 0 £ X t L S Y S u X ? N Q V 7 A 345 

2208 TTT CTT CGA TCC ATT CTT CTT CCA GTG TTT CCT AAA TTA ATC TGG GOT AAC CAA AGG ATA 22 57 

346 F r, o s r L 7 R t/ F ? ... L ; w G M 0 R i 3 65 



i2 68 T.T CAG ATA ATA TTA AAA G gcacc3^acaaaac":a:raccaccaacc2::::accag AC CTC GAA ACT 23 3 6 
366 F Z : Z L X j u £ T 375 
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2337 TTC TTC AAA TTA TCC AGA TAG GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 23 95 

375 f i :< - s r y z 5 r s h r - « s n r ;< 395 

2 3 97 ccaacacgccaaacccrtrracc^ccaaccaacaaccag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2 4 65 

2 95 i s z r £ w l </ l g 405 

2465 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

' 406 K H S N A X M C L S D F Z X * :< Q I F A 425 

252 5 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 25 35 

425 EFIYWLYNSFI I P I L Q S F F Y 445 

25 3 5 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2 54 5 

4461 T £ S S D L R M R T V Y F R X 0 I W X 465 

2 646 CTC TTC TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

-65 L, - C R ? F I T S M X M £ A F £ X I N £ 485 

270 6 gcac^ttaaaccaccctctgcaaaaagcciacacccrcag AAC AAT CTT AGG ATG CAT ACT CAG AAA ACT 277$ 

4 86 N N V R M D T Q X T 495 

2 77 5 ACT TTC CCT CCA GCA GTT ATT CGT CTA TTA CCT .AAG AAG AAT ACC TTT CCT CTC ATT ACG 2 S3 5 

496 tlp?av:rll?x:<^tfrl:t 515 

2336 AAT TTA AGA AAA AGA TTC TTA ATA AAG gcaccaacccccggccaccaacgcacctcicccczaacccatta 2906 

516 N L R X R F L 1 X 524 

2907 -cagcag ATG GGT TCA AAC .AAA AAA ATC TTA GTC AGT ACC AAC CAA ACT TTA CCA OCT GTC 2967 

525 MGS NXXMLVSTtfQTLRPV 542 

2 9*5 3 GCA TCG ATA CTC AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3 027 
543A S I L X H L I W E £ S S G I ? F M L E 562 

302a GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CCA ATG TTT CG gcaac 3088 

5 53 V Y M X L L T F X X D L L X H R MFC 581 

3 089 :acacaacgcgcgactcctcactaccaaccc:gcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

S32 R X X Y F V R Z 0 I 591 

3155 AAA TCC TCT TAT GAT CCA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 X S C Y 0 R I X Q D L M F R r V X X K'L 511 

3 215 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA CCT 3 275 

612 X 0 P EF VI R XY AT I HAT'SDRA 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gcaagcczacccccccactggaacuccccaacaa 3343 

532 T X M F V S £ A F S Y F 64 3 

3344 dc:c:c:mag TT CAT ATG CTC CCT TTT GAA AAA GTC GTC CAG TTA CTT TCT ATG AAA ACA 3405 

544 O « V ? F £ X ^ V Q L L S M X T 659 

3 406 7CA GAT ACT TTC TTT GTT GAT TTT CTC GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3 4 65 

5 50 - J 7 L F D ~ V D Y W T X S S 5 Z I F 579 

3 4 66 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gcacaccaaccgc:;aaccg:aacaaca 3 5 32 

S30 X M 1. K Z. H L 5 G H Z V X 592 
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3 53 3 c:aaccadac:ag ATA GGA AAT TCT CAA TAG CTT CAA AAA GTT GGT ATC CCT CAG GGG TCA 3 593 

593 eg U S Q Y „ L Q '< V G I ? Q C S 708 

3^94 ATT CTC TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTC ATT GAT GAA TAC CTA TCC 3 653 

7091 LSSrLCn ? '{ X Z 0 L I D E Y L S 728 

3 55 4 TTT ACG AAA AAG AAA GGA TCA GTG TTC TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 

729 f t :< :< :< g s v l lrvv doflfit 748 

3 7 14 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gcgagct?ci?tc:attcc 3777 

749 V M :< :< D A K :< F L N L S L R G 764 

3773 caaqccc^accgct^aag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTC GAG AAA ACA GTA 3840 



:S5 



T S L Z K T V 778 



3 041 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3 900 



113 I U F Z X 



798 



3 901 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA CCA TCT 3 960 

7 99 = M ? F F G F 5 V/ X M R S t- O T L L A C 818 

395 1 CCT AAA ATT GAT GAA CCT TTA TTT AAC TCT ACA TCT GTA GAG CTC ACG AAA CAT ATG GGG 4020 

319 ? * ! D £ A L F M 5 T S V £ L T .< H M G 838 

4021 .AAA TCT TTT TTT TAC AAA * ATT CTA AG gcacaccgcgcaaccgaacaacagccgacaaacaa tcag A TCG 4089 

- 3 39 K 3 F F Y :< I L R s 348 

4090 AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

349 S LAS FAQ V F I D I T H N S X FN S 868 

4LSjD TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TCT ATC AGA GCA CAA GCA TAC TTA AAA 4 209 

369 C C N I Y R L G Y S ft C rt * A Q A Y L :< 388 

4 210 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gcgagcaccractrcaaccaga 4 27 4 



389 



4275 aaagccac:aaccaacc::ag AT CTT TTC AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 

904 * L L N V I G R K I W K X L A 917 

4 340 GAA ATA TTA jGGA TAT ACC AGT AGG CGT TTC TTC TCC TCT GCA GAA GTC AAA TC gcacgcgtc 4 401 
918 ETC. CYTSRRFLSSAEVKW 935 

4402 ggccccgagacctcagcaatattgacacaccag G CTT TTT TGT CTT GGA ATC AGA GAT CGT TTG AAA 4468 
936 LFCLGMRDGLK 946 



4 4 69 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTC ACT 
947?S"KYHPCr£QCr YQFQSt-T 



4S28 
966 



4 529 GAT CTT ATC AAG CCC CTA AGA CCA GTT TTC CCA CAG GTC TTA TTT TTA CAT AGA AGA ATA 4 58B 
967 DLIKPLRPVLaQ'/LFLH?. 31 " 6 

4589 CCT GAT TAA cgticacc- -dace :^::acacacaccc: ::accaccggcc:c::aaacaacaccac:accaagcaca 466S 

989 

987 A 3 
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4 666 gcrqacccz22aagcaagcacaccacaggacc:c:accaaagcaaaaccaacc-=?ccacc3qcc::?atcgac::z?ccc 4 74 5 

4 74 6 ccacccc^atactctcaagaaagacLgacagcggccgczgaccaccgcccacacgcccaticaaacgggaQcggc-aaaca 4 82S 

4 82 6 c:aaaagcaacacatgaggccaacciccc:zcact:ragaacaaggaaagtggc:c:ccacaacgaacaacgcccgcacca 4 90S 

4 90 6 dCgcaaaaagacgaagactaLCCiczaaacaagggggaccaagcacacccgaaggaaaagagagcaa cacacccagcgcc 4 9 85 

4 98 6 cccgaagaaagcaaggacaactrggaacaagcctccgcagacgacaggccaaaccczggcgaccgaacczrggcaaaagc 5065 
50 6 6 ccc3ggccacccacggcggccggccc:gccaccgagacgaaaagaaaccaaggacagctcgaacaccaacagcr;:accca 5145 
514 6 acgcctcacacaaggccrcgcLcrcccczcraccccaacrcsgcacgggcgaaaagaaacagcgctaagccatcaccggat: 5225 

5 225 cccgaaatagccaaaccccccggccccccaaagcggaagccraaagaacczzccgaagcctatgaggcctcaaaaacccc 5 3 05 
5306 ccccgaccraaaggaggaacccrccaccgacgaggaaacggacagccraccagccgccgaggagaagcczaacctccwgc 5 3 85 
53 8 6 aaaaaagaaaacaccaccgggagacacczccrgacgaaccagacgcggagagcacccccagcggacccc:gacgccaaca 54 6 5 
5 46 6 acccccaccrccgaaacgcacggccccaccgccgczccgaccrccrgcagccccacccagctaagcgaccaaaggcacc 5 544 
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FIGURE 16 

-page 1 
(Seq. ID. No. 1) 

1 gcagcgcrgc gtcctgctgc gcacgtggga agccctggcc ccggccaccc ccgcgatgcc 
61 gcgcgctccc cgctgccgag ccgtgcgctc cctgctgcgc agccactacc gcgaggtgct 
121 gccgctggcc acgttcgtgc ggcgcctggg gccccagggc tggcggctgg tgcagcgcgg 
181 ggacccggcg gctttccgcg cgctggtggc ccagtgcctg gtgtgcgtgc cctgggacgc 
241 acggccgccc cccgccgccc cctccttccg ccaggtgtcc tgcctgaagg agctggtggc 
3 01 ccgagtigctg cagaggctgt gcgagcgcgg cgcgaagaac gtgctggcct tcggcttcgc 
361 gctgctggac ggggcccgcg ggggcccccc cgaggccttc accaccagcg tgcgcagcta 
421 cctgcccaac acggtgaccg acgcactgcg ggggagcggg gcgtgggggc tgctgctgcg 
481 ccgcgtgggc gacgacgtgc tggtccacct gctggcacgc tgcgcgctct tfcgtgctggt 
541 ggctcccagc tgcgcctacc aggtgtgcgg gccgccgctg taccagctcg gcgctgccac 
601 tcaggcccgg cccccgccac acgctagcgg accccgaagg cgtctgggat gcgaacgggc 
661 ctggaaccat agcgtcaggg aggccggggn ccccccgggc ctgccagccc cgggtgcgag 
721 gaggcgcggg ggcagtgcca gccgaagtct gccgttgccc aagaggccca ggcgtggcgc 
781 tgcccctgag ccggagcgga cgcccgttgg gcaggggtcc tgggcccacc cgggcaggac 
841 gcgtggaccg agcgaccgtg gcttctgtgt ggtgccacct gccagacccg ccgaagaagc 
901 cacctctttg gagggtgcgc tctctggcac gcgccactcc cacccatccg tgggccgcca 
961 gcaccacgcg ggccccccac ccacatcgcg gccaccacgt ccctgggaca cgccttgtcc 
1021 cccggtgtac gccgagacca agcacttcct ctactcctca ggcgacaagg agcagctgcg 
10 81 gccccccttc ctactcagct ctctgaggcc cagcctgact ggcgcccgga ggctcgtgga 
1141 gaccatcttt ctzgggctcca ggccctggat gccagggact ccccgcaggt tgccccgcct 
1201 gccccagcgc tactggcaaa cgcggcccct gtrttctggag ctgcttggga accacgcgca 
1261 gtgcccctac ggggtgctcc tcaagacgca ctgcccgctg cgagccgcgg tcaccccagc 
1321 agccggtgtc tgtgcccggg agaagcccca gggctctgtg gcggcccccg aggaggagga 
1381 cacagacccc cgtcgccngg tgcagctgct ccgccagcac agcagcccct ggcaggtgta 
1441 cggcttcgtg cgggccrgcc tgcgccggcr ggtgccccca ggcctctggg gctccaggca 
1501 caacgaacgc cgcttcctca ggaacaccaa gaagttcatc tccctgggga agcatgccaa 
1561 gctctcgctg caggagctga cgnggaagat gagcgtgcgg gactgcgctt ggctgcgcag 
1621 gagcccaggg gttggccgtg ttccggccgc agagcaccgt ctgcgtgagg agatcctggc 
1681 caagttcctg cactggccga tgagtgtgca cgtcgtcgag ctgctcaggt ctttctttta 
1741 tgtcacggag accacgcttc aaaagaacag gctctttttc taccggaaga gtgtctggag 
1801 caagttgcaa agcantggaa tcagacagca cttgaagagg gtgcagctgc gggagctgtc 
1861 ggaagcagag gtcaggcagc atcgggaagc caggcccgcc ccgccgacgt: ccagactccg 
1921 cttcatcccc aagccngacg ggctgcggcc gattgtgaac atggactacg tcgtgggagc 
1981 cagaacgttc cgcagagaaa agagggccga gcgtctcacc tcgagggcga aggcactgtt 
2041 cagcgtgctc aactacgagc gggcgcggcg ccccggcctc ctgggcgcct ctgtgctggg 
2101 cctggacgat atccacaggg cctggcgcac cttcgtgctg cgtgtgcggg cccaggaccc 
2161 gccgcctgag ctgtactttg tcaaggtgga tgtgacgggc gcgtacgaca ccatccccca 
2221 ggacaggctc acggaggcca tcgccagcat catcaaaccc cagaacacgt actgcgtgcg 
2231 tcggtatgcc gtggtccaga aggccgccca tgggcacgtc cgcaaggcct tcaagagcca 
2341 cgtctctacc ttgacagacc tccagccgta catgcgacag ttcgtggctrc acctgcagga 
2401 gaccagcccg ctgagggatg ccgtcgtcat cgagcagagc tcctccctga atgaggccag 
2461 cagtggcctc ttcgacgtct tcctacgctt catgtgccac cacgccgtgc gcatcagggg 
2521 caagtcctac gtccagtgcc aggggatccc gcagggctcc atcctctcca cgctgctctg 
25 Bl cagcccgtgc tacggcgaca tggagaacaa gctgtttgcg gggattcggc gggacgggct 
2641 gctcctgcgt ttggtggacg atttcttgtt ggtgacaccc cacctcaccc acgcgaaaac 
2701 cttcctcagg accctggccc gaggtgtccc tgagcatggc tgcgtggtga acttgcggaa 
2761 gacagtggtg aacttccctzg cagaagacga ggccctgggt ggcacggctt ttgttcagat 
2821 gccggcccac ggcctatccc cctggtgcgg ccrgccgctg gatacccgga ccctggaggt: 
2881 gcagagcgac tactccagct atgcccggac ctccatcaga gccagtctca ccttcaaccg 
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2941 cggcttcaag gctgggagga acatgcgtcg caaactcttt ggggtictitgc ggctgaagtg 

3001 tcacagccrg tttctggact tgcaggtgaa cagccrccag acggtgtgca ccaacatcta 

3 061 caaganccrc ctgctgcagg cgcacaggtt. tcacgcatgt gtgctgcagc tcccatttca 

3121 tcagcaagtt tggaagaacc ccacattttt cctgcgcgtc atctctgaca cggcctccct: 

3181 ctgctactcc atcctgaaag ccaagaacgc agggatgtcg ctgggggcca agggcgccgc 

3 241 cggcccrctg ccctccgagg ccgtgcagtg gctgtgccac caagcactcc tgctcaagct 

3 301 gactcgacac cgtgtcacct acgtgccact cctggggtca ctcaggacag cccagacgca 

3 361 gctgagtcgg aagctcccgg ggacgacgct gactgccctg gaggccgcag ccaacccggc 

3 421 actgccctca gacttcaaga ccatcctgga ctgatggcca cccgcccaca gccaggccga 

3481 gagcagacac cagcagccct gtcacgccgg gctctacgtc ccagggaggg aggggcggcc 

3 541 cacacccagg cccgcaccgc tgggagtctg aggccrgagt gagtgtttgg ccgaggcctg 

3 601 catgtccggc tgaaggccga gtgtccggct gaggcctgag cgagtgtcca gccaagggcn 

3 661 gagtgtccag cacacctgcc gtcttcactt ccccacaggc tiggcgctcgg ctccacccca 

3 721 gggccagctt ttcctcacca ggagcccggc ttccaccccc cacataggaa tagtccaccc 

3781 ccagattcgc catcgttcac ccctcgccct gccctccttt gccttccacc cccaccatcc 

3 841 aggtggagac cctgagaagg accctgggag ctctgggaat ttggagtgac caaaggtgtg 

3 901 ccctgtacac aggcgaggac cctgcacctg gatgggggrc cctgtgggcc aaattggggg 

3 961 gaggtgccgn gggagtaaaa tactgaacat atgagttttt cagttttgaa aaaaa 



# 



FIGURE 17 
HUMAN TRT PROTEIN SEQUENCE 
(SEQ. NO. 2) 

MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP 

AAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERGAKNVLAFGFA 
LLDGARGGPPEAFTTSWSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLIJVRCALFV 
LVAP S CAYQVCGP PLYQLGAATQARP P PHAS GPRRRLGCERAWNHS VREAGVPLGLPA 
PGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCWSPA 
RPAEEATSLEGALSGTRHSKPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYS 
SGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPL 
FLELLGNHAQCPYGVLLKTKCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQ 
LLRQHSSPWQWGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQEL 
TWKMSVRDC^WLRRSPGVGCVPAAEKRLREEI 

TFQKNRLFFYRKSWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFI 
PKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKAIiFSV^ 

LDD I HRAWRTFVLRVRAQDP P PELYFVKVDVTGAYDT I PQDRLTEVI AS 1 1 KPQNTYC 
VRRYAVVQKAAHGHVRKAFKSHVSTLTDLQ 

NEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAG 
IRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCYVNLRKTVVNFPVEDEAL 
GGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRR 
KLFGVLRLKCTSLFLDLQWSLQTVCTNIYKILLLQAYRFHAC^QLPFHQQVWKNPT 
FFLRV I S DTAS LC YS I LKAKNAGM S LG AKG AAG P LP S E AVQ WLCHQ AFLL KLTRHRVT 
YVPLLGS LRTAQTQLS RKLPGTTLTALEAAANP AJLP SDFKT I LD 



FIGURE 18 

Clone 712562 
(SEQ ID NO. 3 ) 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 
TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 
GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 
GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 
CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 
AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 
GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 
GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 
CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCZATCCC 
CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 
GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 
CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 
GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 
TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 
CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 
AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 
TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 
TGCAGAGCGACTACTCC^GCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACC 
GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 
GTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCZACCAACZATCT 
ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTC 
ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCC 
TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 
CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 
TGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 
AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 
CACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCZACCCGCCCACZAGCCAGGCCG 
AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 
CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 
GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 
TGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 

AGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC 
CCC^GATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 
CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 
GCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 
GGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTC^GTTTTGOAAAAAAAAAA 
AAAAAAAAAAAAAAAA 



FIGURE 19" 

SEQ ID NO. 10 
MetSerValTyrValValGluLeuLeuArgSerPhePhe 

TyrValThrGluThrThrPheGlnLysAsnArgLeuPhePheTyrArgLysSerValTrp 
SerLysLeuGlnSerlleGlylleArgGlnHisLeuLysArgValGlnLeuArgGluLeu 
SerGluAlaGluValArgGlnHisArgGluAlaArgProAlaLeuLeuThrSerArgLeu 
ArgPhelleProLysProAspGlyLeiiArgProIleValAsnMetAspTyrValValGly 
AlaArgThrPheArgArgGluLysArgAlaGluArgLeuThrSerArgValLysAlaLeu 
PheSerValLeuAsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAlaSerValLeu 
GlyLeuAspAspIleHisArgAlaTrpArgThrPheValLeuArgValArgAlaGlnAsp 
ProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrllePro 
GlnAspArgLeuThrGluVallleAlaSerllelleliysProGlnAsnThrTyrCysVal 
ArgArgTyrAlaValValGinLysAlaAlaHisGlyHisValArgLysAlaPheLysSer 
HisValLeuArgProValProGlyAspProAlaGlyLeuHisProLeuHisAlaAlaLeu 
GlnProValLeuArgArgHisGlyGluGlnAlaValCysGlyAspSerAlaGlyArgAla 
AlaProAlaPheGlyGly 



# 
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SEQUENCE NO. 4 (DNA) AND SEQUENCE NO. 5 (PROTEIN) 
(TRANSLATION OF A A182 hTRT VARIANT) 



1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his cyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 



ala arg pro pro pro ala ala 
GCA CGG CCG CCC CCC GCC GCC 

80 

leu lys glu leu val ala arg 
CTG AAG GAG CTG GTG GCC CGA 



gly ala lys asn val leu ala 

GGC GCG AAG AAC GTG CTG GCC 

110 

ala arg gly gly pro pro glu 

GCC CGC GGG GGC CCC CCC GAG 



70 

pro ser phe arg gin val ser cys 
CCC TCC TTC CGC CAG GTG TCC TGC 

90 

val leu gin arg leu cys glu arg 
GTG CTG CAG AGG CTG TGC GAG CGC 

100 

phe gly phe ala leu leu asp gly 
TTC GGC TTC GCG CTG CTG GAC GGG 

120 

ala phe thr thr ser val arg ser 
GCC TTC ACC ACC AGC GTG CGC AGC 



tyr leu pro asn thr val thr 

TAC CTG CCC AAC ACG GTG ACC 

140 

trp gly leu leu leu arg arg 

TGG GGG CTG CTG CTG CGC CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala cyr gin val cys gly pro 
GCC TAC CAG GTG TGC GGG CCG 



thr gin ala arg pro pre pro 

ACT CAG GCC CGG CCC CCG CCA 

200 

leu gly cys glu arg ala trp 

CTG GGA TGC GAA CGG GCC TGG 

val pro leu gly leu pro ala 

GTC CCC CTG GGC CTG CCA GCC 

230 

ser ala ser arg ser leu pro 

AGT GCC AGC CGA AGT CTG CCG 



ala ala pro glu pro glu arg 

GCT GCC CCT GAG CCG GAG CGG 

260 

ala his pro gly arg thr arg 

GCC CAC CCG GGC AGG ACG CGT 
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130 

asp ala leu arg gly ser gly ala 
GAC GCA CTG CGG GGG AGC GGG GCG 

150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC CAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGG CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 

270 

gly pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 
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val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 

290 

gly ala leu ser gly thr arg 
GGT GCG CTC TCT GGC ACG CGC 

gin his his ala gly pro pro 
CAG CAC CAC GCG GGC CCC CCA 

320 

trp asp chr pro cys pro pro 
TGG GAC ACG CCT TGT CCC CCG 



leu tyr ser ser gly asp lys 
CTC TAC TCC TCA GGC GAC AAG 

350 

leu ser ser leu arg pro ser 
CTC AGC TCT CTG AGG CCC AGC 



glu thr ile phe leu gly ser 

GAG ACC ATC TTT CTG GGT TCC 

380 

arg arg leu pro arg leu pro 

CGC AGG TTG CCC CGC CTG CCC 

leu phe leu glu leu leu gly 

CTG TTT CTG GAG CTG CTT GGG 

410 

val leu leu lys thr his cys 

GTG CTC CTC AAG ACG CAC TGC 



280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 

300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyr ala glu thr lys his phe 
GTG TAC GCC GAG ACC AAG CAC TTC 

340 

glu gin leu arg pro ser phe leu 
GAG CAG CTG CGG CCC TCC TTC CTA 

360 

leu thr gly ala arg arg leu val 
CTG ACT GGC GCT CGG AGG CTC GTG 

370 

arg pro trp met pro gly thr pro 
AGG CCC TGG ATG CCA GGG ACT CCC 

390 

gin arg tyr trp gin met arg pro 
CAG CGC TAC TGG CAA ATG CGG CCC 

400 

asn his ala gin cys pro tyr gly 
AAC CAC GCG CAG TGC CCC TAC GGG 

420 

pro leu arg ala ala val thr pro 
CCG CTG CGA GCT GCG GTC ACC CCA 

430 
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ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met ser val 
ATG AGC GTG 



giy cys val 
GGC TGT GTT 



ala lys phe 
GCC AAG TTC 



470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



leu his trp 
CTG CAC TGG 



val pro pro 
GTG CCC CCA 

490 

leu arg asn 
CTC AGG AAC 



leu ser leu 
CTC TCG CTG 

520 

ala trp leu 
GCT TGG CTG 



glu his arg 
GAG CAC CGT 

550 

leu met ser 
CTG ATG AGT 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



val tyr val 
GTG TAC GTC 



ser val ala 
TCT GTG GCG 

450 

val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 

gly ser arg 
GGC TCC AGG 

phe ile ser 
TTC ATC TCC 

510 

thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 

glu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 



ala ala gly val 
GCA GCC GGT GTC 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



cys ala 
TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 



arg glu lys 
CGG GAG AAG 



thr asp pro 
ACA GAC CCC 

460 

pro trp gin 
CCC TGG CAG 



thr thr phe 
ACC ACG TTT 



570 

gin lys asn 
CAA AAG AAC 



giy cys val pro ala ala glu his arg 
GGC TGT GTT CCG GCC GCA GAG CAC CGT 

550 

ala lys phe leu his trp leu met ser 
GCC AAG TTC CTG CAC TGG CTG ATG AGT 

560 

leu arg ser phe phe tyr val thr glu 
CTC AGG TCT TTC TTT TAT GTC ACG GAG 



580 

arg leu phe phe tyr arg lys ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC 




590 

ile gly ile arg gin his leu 

ATT GGA ATC AGA CAG CAC TTG 

ser glu ala glu val arg gin 

TCG GAA GCA GAG GTC AGG CAG 

620 

leu thr ser arg leu arg phe 

CTG ACG TCC AGA CTC CGC TTC 



pro ile val asn met asp tyr 

CCG ATT GTG AAC ATG GAC TAC 

650 

arg glu lys arg ala glu arg 

AGA GAA AAG AGG GCC GAG CGT 

phe ser val leu asn tyr glu 

TTC AGC GTG CTC AAC TAC GAG 

680 

gly ala ser val leu gly leu 

GGC GCC TCT GTG CTG GGC CTG 

thr phe val leu arg val arg 

ACC TTC GTG CTG CGT GTG CGG 

710 

tyr phe val lys val asp val 

TAC TTT GTC AAG GTG GAT GTG 



gin asp arg leu thr glu val 
CAG GAC AGG CTC ACG GAG GTC 
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600 

lys arg val gin leu arg glu leu 
AAG AGG GTG CAG CTG CGG GAG CTG 

610 

his arg glu ala arg pro ala leu 
CAT CGG GAA GCC AGG CCC GCC CTG 

630 

ile pro lys pro asp gly leu arg 
ATC CCC AAG CCT GAC GGG CTG CGG 

640 

val val gly ala arg thr phe arg 
GTC GTG GGA GCC AGA ACG TTC CGC 

660 

leu thr ser arg val lys ala leu 
CTC ACC TCG AGG GTG AAG GCA CTG 

670 

arg ala arg arg pro gly leu leu 
CGG GCG CGG CGC CCC GGC CTC CTG 

690 

asp asp ile his arg ala trp arg 
GAC GAT ATC CAC AGG GCC TGG CGC 

700 

ala gin asp pro pro pro glu leu 
GCC CAG GAC CCG CCG CCT GAG CTG 

720 

thr gly ala tyr asp thr ile pro 
ACG GGC GCG TAC GAC ACC ATC CCC 

730 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 
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740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val leu arg pro 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

770 780 
val pro gly asp pro ala gly leu his pro leu his ala ala leu 
GTG CCA GGG GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 

790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 
CAG CCT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

800 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 
GACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 
GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 
CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 
GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 
CATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAA 
ACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAG 
CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTC^ 
CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCGAGATTTTTCCT 
GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 
GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 
GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 
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GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 
TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 
ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 
CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 
CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 
GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 
CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 
CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 
CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 
TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 
GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 
AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 21" 

Genomic DNA insert of pGRN144 

Seq. ID. No. 6 

1 CCATGGGACCCACTGCAGGGGCAGCTGGGAGGCTGCAGGCTTCAGGTCCCAGTGGGGTTG 
GGTACCCTGGGTGACGTCCCCGTCGACCCTCCGACGTCCGAAGTCCAGGGTCACCCCAAC 

6 1 CCATCTGCCAGTAGAAACCTGATGTAGAATCAGGGCGCGAGTGTGGACACTGTCCTGAAT 
GGTAGACGGTCATCTTTGGACTACATCTTAGTCCCGCGCTCACACCTGTGACAGGACTTA 

121 CTCAATGTCTCAGTGTGTGCTGAAACATGTAGAAATTAAAGTCCATCCCTCCTACTCTAC 
GAGTTACAGAGTCACACACGACTTTGTACATCTTTAATTTCAGGTAGGGAGGATGAGATG 

181 TGGGATTGAGCCCCTTCCCTATCCCCCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTG 
ACCCTAACTCGGGGAAGGGATAGGGGGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACAC 

241 GAGGAAGGAATGATACTTTGTTATTTTTCACTGCTGGTACTGAATCCACTGTTTCATTTG 
CTCCTTCCTTACTATGAAACAATAAAAAGTGACGACCATGACTTAGGTGACAAAGTAAAC 



**************************************** 
3 01 TTGGTTTGTTTGTTTTGTTTTGAGAGGCGGTTTCACTCTTGTTGCTCAGGCTGGAGGGAG 
AACCAAACAAACAAAACAAAACTCTCCGCCAAAGTGAGAACAACGAGTCCGACCTCCCTC 



*********** ************************************************* 
361 TGCAATGGCGCGATCTTGGCTTACTGCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCT 
ACGTTACCGCGCTAGAACCGAATGACGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGA 

alu 

^^^t** ******************************************** ****** 
421 GCTTCCGCCTCCCATTTGGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTT 
CGAAGGCGGAGGGTAAACCGACCCTAATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAA 



************************************************************ 
481 TGTATTTTTAGTAGAGACGGGGGTGGGGGTGGGGTTCACCATGTTGGCCAGGCTGGTCTC 
ACATAAAAATCATCTCTGCCCCCACCCCCACCCCAAGTGGTACAACCGGTCCGACCAGAG 

CAP 



************************************************************ 
541 GAACTTCTGACCTCAGATGATCCACCTGCCTCTGCCTCCTAAAGTGCTGGGATTACAGGT 
CTTGAAGACTGGAGTCTACTAGGTGGACGGAGACGGAGGATTTCACGACCCTAATGTCCA 



********************* 

601 GTGAGCCACCATGCCCAGCTCAGAATTTACTCTGTTTAGAAACATCTGGGTCTGAGGTAG 
CACTCGGTGGTACGGGTCGAGTCTTAAATGAGACAAATCTTTGTAGACCCAGACTCCATC 



FIGURE 21 (continued, 2 of 6) 

CCAAT 
***************> 

661 GAAGCTCACCCCACTCAAGTGTTGTGGTGTTTTAAGCCAATGATAGAATTTTTTTATTGT 
CTTCGAGTGGGGTGAGTTCACAACACCACAAAATTCGGTTACTATCTTAAAAAAATAACA 

721 TGTTAGAACACTCTTGATGTTTTACACTGTGATGACTAAGACATCATCAGCTTTTCAAAG 
ACAATCTTGTGAGAACTACAAAATGTGACACTACTGATTCTGTAGTAGTCGAAAAGTTTC 

CAP 

***************> 

781 ACACACTAACTGCACCCATAATACTGGGGTGTCTTCTGGGTATCAGCGATCTTCATTGAA 
TGTGTGATTGACGTGGGTATTATGACCCCACAGAAGACCCATAGTCGCTAGAAGTAACTT 

CAP 

************ 

841 TGCCGGGAGGCGTTTCCTCGCCATGCACATGGTGTTAATTACTCCAGCATAATCTTCTGC 
ACGGCCCTCCGCAAAGGAGCGGTACGTGTACCACAATTAATGAGGTCGTATTAGAAGACG 



***> 

901 TTCCATTTCTTCTCTTCCCTCTTTTAAAATTGTGTTTTCTATGTTGGCTTCTCTGCAGAG 
AAGGTAAAGAAGAGAAGGGAGAAAATTTTAACACAAAAGATACAACCGAAGAGACGTCTC 

CAP 

***************> 

961 AACC^GTGTAAGCTACAACTTAACTTTTGTTGGAAC^U\ATTTTCCAAACCGCCCCTTTGC 
TTGGTCACATTCGATGTTGAATTGAAAACAACCTTGTTTAAAAGGTTTGGCGGGGAAACG 

1021 C CTAGTGGCAGAGACAATTCACAAACACAG C C CTTTAAAAAGGCTTAGGGAT CACTAAGG 
GGATCACCGTCTCTGTTAAGTGTTTGTGTCGGGAAATTTTTCCGAATCCCTAGTGATTCC 

1081 GGATTTCTAGAAGAGCGACCCGTAATCCTTAAGTATTTACAAGACGAGGCTAACCTCCAG 
CCTAAAGATCTTCTCGCTGGGCATTAGGAATTCATAAATGTTCTGCTCCGATTGGAGGTC 

1141 CGAGCGTGACAGCCCAGGGAGGGTGCGAGGCCTGTTCAAATGCTAAGCTTCCATAAATAA 
GCTCGCACTGTCGGGTCCCTCCCACGCTCCGGACAAGTTTACGATTCGAAGGTATTTATT 

1201 AGCAAATTTCCTCCGGCAGTTTCTGGAAAGTAGGAAAGGTTAACATTTAAGGTTGCGTTT 
TCGTTTAAAGGAGGCCGTCAAAGACCTTTCATCCTTTCCAATTGTAAATTCCAACGCAAA 

1261 GTTAGCATTTCAGTGTTTGCCGACCTCAGCTAACAGCATCCCTGCAAGGCCTCGGGAGAC 
CAATCGTAAAGTCACAAACGGCTGGAGTCGATTGTCGTAGGGACGTTCCGGAGCCCTCTG 

1321 CCAGAAGTTTCTCGCCCCTTAGATCCAAACTTGAGCAACCCGGAGTCTGGATTCCTGGGA 
GGTCTTCAAAGAGCGGGGAATCTAGGTTTGAACTCGTTGGGCCTCAGACCTAAGGACCCT 

TopoII 
*****************> 

1381 AGTCCTCAGCTGTCCTGCGGTTGTGCCGGGGCCCCAGGTCTGGAGGGGACCAGTGGCCGT 
TCAGGAGTCGACAGGACGCCAACACGGCCCCGGGGTCCAGACCTCCCCTGGTCACCGGCA 

1441 GTGGCTTCTACTGCTGGGCTGGAAGTCGGGCCTCCTAGCTCTGCAGTCCGAGGCTTGGAG 
CACCGAAGATGACGACCCGACCTTCAGCCCGGAGGATCGAGACGTCAGGCTCCGAACCTC 



# 



FIGURE 21 (continued, 3 of 6) 

1501 CCAGGTGCCTGGACCCCGAGGCTGCCCTCCACCCTGTGCGGGCGGGATGTGACCAGATGT 
GGTCCACGGACCTGGGGCTCCGACGGGAGGTGGGACACGC CCGCC CTAC ACTGGTCTAC A 

1561 TGGCCTCATCTGCCAGACAGAGTGCCGGGGCCCAGGGTCAAGGCCGTTGTGGCTGGTGTG 
ACCGGAGTAGACGGTCTGTCTCACGGCCCCGGGTCCCAGTTCCGGCAACACCGACCACAC 

1621 AGGCGCCCGGTGCGCGGCCAGCAGGAGCGCCTGGCTCCATTTCCCACCCTTTCTCGACGG 
TCCGCGGGCCACGCGCCGGTCGTCCTCGCGGACCGAGGTAAAGGGTGGGAAAGAGCTGCC 

1681 GACCGCCCCGGTGGGTGATTAACAGATATTGGGGTGGTTTGCTCATGGTGGGGACCCCTT 
CTGGCGGGGCCACCCACTAATTGTCTATAACCCCACCAAACGAGTACCACCCCTGGGGAA 

1741 CGCCGCCTGAGAACCTGCAAAGAGAAATGACGGGCCTGTGTCAAGGAGCCCAAGTCGCGG 
GCGGCGGACTCTTGGACGTTTCTCTTTACTGCCCGGACACAGTTCCTCGGGTTCAGCGCC 

18 01 GGAAGTGTTGCAGGGAGGCACTCCGGGAGGTCCCGCGTGCCCGTCCAGGGAGCAATGCGT 
CCTTCACAACGTCCCTCCGTGAGGCCCTCCAGGGCGCACGGGCAGGTCCCTCGTTACGCA 

1861 CCTCGGGTTCGTCCCCAGCCGCGTCTACGCGCCTCCGTCCTCCCCTTCACGTCCGGCATT 
GGAGCCCAAGCAGGGGTCGGCGCAGATGCGCGGAGGCAGGAGGGGAAGTGCAGGCCGTAA 

1921 CGTGGTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGA 
GCACCACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCT 

1981 TCAGGCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGC 
AGTCCGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCG 

2041 CCCTCCCTCGGGTTACCCCACAGCCTAGGCCGATTCGACCTCTCTCCGCTGGGGCCCTCG 
GGGAGGGAGCCCAATGGGGTGTCGGATCCGGCTAAGCTGGAGAGAGGCGACCCCGGGAGC 

Spl 
******** 

2101 CTGGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCAGAC 
GACCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTCTG 

2161 CCCCGGGTCCGCCCGGAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCG 
GGGGCCCAGGCGGGCCTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGC 

2221 CGGGCAACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAGGGACTGGGGACCCGGGCA 
GCCCGTTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTCCCTGACCCCTGGGCCCGT 

Spl 



E2F 
******** 

2281 CCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCCCGTCCC 
GGCCAGGACGGGGAAGTGGAAGGTCGAGGCGGAGCAGGCGCGCCTTGGGGCGGGGCAGGG 

2341 GAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGGCCATCCCAGCCCGTCCCGTTCCT 
CTTGGGAAGGGCCCAGGGGCCGGGTCGGGGAAGGCCCGGTAGGGTCGGGCAGGGCAAGGA 



FIGURE 21 (continued, 4 of 6) 

Spl 



E2F NFkB 
********* ********* ******************** 

2401 TTTCCGCGGCCCCGCCCTCTCCTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGC 

AAAGGCGCCGGGGCGGGAGAGGAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACG 

hTRT5 1 

*****************************> 

2461 GCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAG 
CGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTC 

2521 CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGC 
GGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACG 

E2F 
******* 

2581 GGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCG 
CCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGC 



* 

2641 CGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCC 
GCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGG 

NFkB 



********************************************** 
2701 CCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCG 
GGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGC 

Topo_II_cleavag 



NFkB 
++++++++++ 
NFkB 



Intronl 

*********************************************************> 

2761 GGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGT 
CCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCA 

e site 



2821 GTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAA 
CAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTT 

2881 GAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGC 
CTTGCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCG 



2 941 CTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAG 
GAAGTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTC 



FIGURE 21 (continued, 5 of 6) 



3001 CGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGC 
GCCCCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCG 

3061 ACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCC 
TGCGACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGG 

3121 GCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCG 
CGACATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGC 

3181 AAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCT 
TTCCGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGA 

3241 GGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTT 
CCCGGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAA 

3301 GCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGG 
CGGGTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCC 

3361 GTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTC 
CAGGACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAG 

3421 ACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCA 
TGGACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGT 

3481 CTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACC 
GAGGGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGG 

3541 ACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTC 
TGCAGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAG 

3601 CTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCT 
GAGTCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGA 

3661 GACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGG 
CTGACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTC CGGGACCTACGGTC C 

3721 GACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCT 
CTGAGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGA 

3781 GGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCC 
CCTCGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGG 

3841 GCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTC 
CGACGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAG 

3901 TGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCA 
ACACCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGT 

3961 GCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCC 
CGTGTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGG 

4021 CCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTT 
GGGTCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAA 




FIGURE 21 (continued, 6 of 6) 

4081 CATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGT 
GTAGAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCA 



**************************** 

4141 GCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCC 
CGCCCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGG 

lntron2 

************************************************************ 

4201 AGGCCCCAGAGCTGAATGCAGTAGGGGCTCAGAAAAGGGGGCAGGCAGAGCCCTGGTCCT 
TCCGGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGA 

****************** ******************************* *********** 

4261 CCTGTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACG 
GGACAGAGGTAGCAGTGCAC C CGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGC 



*****> 
4321 GTGATCGAGGTCGAC 
CACTAGCTCCAGCTG 



Figure 22 





# 



FIGURE 23 

EST AA281296 
(Seq. ID. No. 8) 



gc 

caagttcctg 
tgtcacggag 
caagttgcaa 
ggaagcagag 
cttcatcccc 
cagaacgttc 
cagcgtgctc 



cactggctga 
accacgtttc 
agcattggaa 
gtcaggcagc 
aagcctgacg 
cgcagagaaa 
aactacgagc 



tgagtgtgta 
aaaagaacag 
tcagacagca 
atcgggaagc 
ggctgcggcc 
agagggccga 
gggcgcg 



cgtcgtcgag 
gctctttttc 
cttgaagagg 
caggcccgcc 
gattgtgaac 
gcgtctcacc 



ctgctcaggt 
taccggaaga 
gtgcagctgc 
ctgctgacgt 
atggactacg 
tcgagggtga 



ctttctttta 
gtgtctggag 
gggacgtgtc 
ccagactccg 
tcgtgggagc 
aggcactgtt 



111 HII 'I II Ill 



FIGURE 24- 
(Seq.JD. No. 9) 

TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 

ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGC 

CAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCAT 
CAGGGGCAAGTC 



# 
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PANEL A 



panel b 





1 2 3X 5 6 7 fl 




Figure 28 



NXT Heparin Affinity Qlyc.grad. 




2 3 1 5 6 7 8? 1011I2I3 



100% 62% 132% 60% 
rakitlvo octlvtty/pmoJ taJonwats 
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Figure 31 - 
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1-3 



i-i 7-9 10-12 13-15 1A-1S 19-21 22-2* 25-27 2*-30 



OOOOOOOOOO 
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teiomerase 



template 
CCAAAACCCCAAAAC 

. T l 1 



5" 



teiomerase 



template 



CCAAAACCCCAAAAC 



• CCI T t T-y 




PANEL A 



PANEL B 
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Figure 33 




25-27 28-30 



# 



Figure 34 

! CCCCAAAACC ccaaaacccc aaaaccccta taaaaaaaga aaaaattgag 
5 1 gtagtttag a aataaaatat tattcccgca caaatggaga tggatattga 
101 tttggatgat atagaaaatt tacttcctaa tacattcaac aagtatagca 
1 5 1 gctcttgtag tgacaagaaa ggatgcaaaa cattgaaatc tggctcgaaa 
201 tcgccttcat tg actattcc aaagttgcaa aaacaattag agttctactt 
25 1 ctcggatgca aatctttata acgattcttt cttgagaaaa ttagttttaa 

301 AAAGCGGAGA gcaaagagta GAAATTGAAA CATTACTAAT GTTTAAATAA 
3 5 1 AATC AGGTA A TG AGG ATTAT TCTATTTTTT AG ATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
45 1 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
50 i GATCTCGATA CATCAGACTT ACC AAAG ACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
65 1 TTGAGAC AAT TG AAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
SOI TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
85 i TTGGATTCTT AAAGCATAG A TACACAGAAT GCTTTAGAGA CTGATTTAGC 
90 1 TTACAACAGA TTACCTGTTT TG ATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1 00 1 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1 05 1 AAAGAAG AAG AGCTATC ACA ATCCTGATTC TTAAAGATTT C AAAAATTCC 
1101 AGGTAAGAGA GATACATTCA TTAAA ATTC A TATATTATAG TTTTTCATTT 
! I 5 1 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1 20 1 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1251 TC AC ATTC AT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 
1301 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
13 51 AAGAAATGG A GCCG AAATCT TAATCAAAAA G AATTGCGTC GATATTGCAA 
1401 AAGAATCGAA CTCTAAATCTTTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA attactaata tagagatcag 
1551 CGATCTTCAA ttgacgaaat aaaagctgaa ctaaagttag acaataaaaa 

1 60 1 ATACAA ACCT TGGTC AAAAT ATTG AGGA AG G AAAAGAAG A CCAGTTAGCA 

165 i AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 

I 701 AAGATTTATT I I 1 1 1CAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 

1751 GGGGTTTTGG GG 



• 



CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 
1 + + + + + + 6Q 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

a PQNPKTPKPL* KKKKLR* FR 

b PKTPKPQNPYKKRKN*GSLtE - 

c PKPQNPKTPIKKEKIEVV * K- 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 
61 + + + + + + 12Q 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

a NKILFPHKWRWILIWMI*KI 

b IKYYSRTNGDGY*FG*YRKF- 

c * N I I PAQMEMDXDLDDI ENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHSTSIAALVVTRKDAK 

b TS*YIQQV*QtiL*CQERMQN- 

c LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H *NLARNRLH * LFQSCKNN* 

b IEIWLEIAFIDYSKVAKTIR- 

C LKSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTOTTTTAATCAAAATT 

a SSTSRMQIFITILS*EN*F* 

b VLLLGCKSL*RFFLEKISFK- 

c FYFSDANLYNDSFLRKLVLK- 

AAkGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 
301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE*KLKHY*CLNKIR* 

b KRRAKSRN*NITNV*IKSGN- 

c SGEQRVEI ETLLMFK *NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 
361 + + + + + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a *GLFYFLDHFLRSIMEKIT* 

b EDYSIF* ITS*GALWRKLLN- 

c RI I LF FRS LLK EHYGENYL I - 



FIG. 35 



TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 
421 + + + + + + 430 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*KVNSL.DYFPSQQ**VY*I 

b TKR*TVWI ISLANNDEYIKF- 

c LKGKQFGLFP * PTMMS I LNS- 



CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 
+ h h + + ^ 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 



a HMRMSQRISIHQTYQRQTRY 

b I *E*VKGSRYIRIiTKDKLAI 

C YENESKDIjDTSDLPKTNSL * - 



AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

I- + H (- H + 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 



a KTQEKV* *SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTIR- 

c NARKSLI IEQQKHLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 
601 + + + + + + 660 

ATACCCAAAATAATGTTAACMAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRR*TPES * D N 

b MGFITIVLGIDGELPSLETI- 

c WVLLQLF *VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 
661 + + + + + + 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a *KSCLQLKESQF*KFCCVCH 

b EKAVYN * R N R S S ESSDVYAI - 

c KKLFTTEGIAVLKVLMCMPL- 

TATTTTOTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 
721 + + + + + + 780 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN* SQISYLNLMDSYRNK 

b IL*INLKYLISI*WIAIETN- 

c FCELI SNILSQFNG* I# * KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + + -f- + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKSFGTNAH 

b QINHASLMEYTLNPLGQMHT- 

c K * TMQV*WNIR * ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 
841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a * IYIGFLKHRYTECFRD* FS 

b EFILDS *SIDTQNALETDLA- 

C NLYWILKA* IHRML * R I* I * L - 



F/G. 35 

(CONTINUED) 



TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

+ H + + + 4- 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 



a LQQITCFDYSCSSLISLKEA 

b YNRLPVLITLAHLLYL * K K Q - 

C TTDYLF * LLLLISYIFKRSR- 



GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

H »- H + + h 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 



a GEMKRRLKKEI SKFVDSSVT 

b AK*KED*RKRFQNLLILL * P 

C RNEKKTKERDFKIC * FFCNR- 

GG AATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 
1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS * F 

b ELTTRILATKKKKSYHNPDS- 

c N * QQEY *QRKRRRAITI LI L- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 
1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKIPGKRDTFIKIHIL* 

b * RFQKFQVREIHSLKFIYYS 

c KDFKNSR* ERYIH*NSYI IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD* LE 

b FSFHSCYFLLS * Q Y F L I SWK- 

C FHFTAVIFFYLNNIF * LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 
1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR *TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI- 

C KKYQIREAIjD*GNLAYSHS * — 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 
1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKN- 

c IDLHI SNTMIRKQQSSVLKI- 



TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 
+ + + + + + 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 



a * CYED * IFRVKKWSRNLNQK 
b SAMRTKFLESRNGAEILIKK 
C VL*GLNF*SQEMEPKS* SKR 
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(CONTINUED) 



GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 
1381 + + + + + + 144Q 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRRYCKRIEL*IFR**VLP 

b NCVDIAKESNSKSFVNKYYQ- 

c IASILQKNRTLNLSLISITN- 



ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

+ ■+ i H H K 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 



a ILIDCRDCRGNCT£DH*RNK 
b S * lilEElDEATAQKI I K E IK 
c LD*LKRLTRQLHRRSIiKK*S 



GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

" H — - — — — ( 1 h — ». 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 



, * VTFIN*RIN*ITNIEISDLQ 
=|b *IjLLIRE*TKLLI*RSAIFN- 
;|G NFY*LENKLNY*YRDQRSSI- 

TTGACGAAATAAAAGC TGAACTAAAGTTAGACAAT AAAAAATAC AAACC TTG GTCAAAAT 
y 1561 + + + + + + 162 o 

" AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

» a LTK*KLN*S*TIKNTNLGQN 
; r ;b *RNKS*TKVRQ*KIQTIiVKI- 
!1 <= DE I KAEXiKLiDNKKYK PW S K Y - 

^ ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 
' 1621 + + -»- + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

IEEGKEDQLAKEKXRQ* IK* 
« b LRKEKKTS *QKKK*GNK * N E - 
Z c *GRKRRPVSKRKNKAINKMS- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 
1681 + + + + + -f- 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK*RNKRFIFFNNLLKRGV 

b YRSEEIKDLFFSIIY*KEGF- 

C TEVKK*KIYFFQ*FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
c GVLGFW 
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1 aactcatna attactaart taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
6 1 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
1 8 1 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtrtgtTg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcacn cngaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
42 i ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacnactg gaagtctgtg aatngcata ggttctctat atttttgatg caactgaatt 
54 1 caaaaatng tatcttgata ggatacrttc ataagatatt cgtaaggaac tcactttccg 
60 1 taagtgma caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
66 1 taagtangc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
72 1 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgrtga 
84 1 agatgcaatc aaggctrtaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtacn 
96 1 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1 02 1 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1 14! aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agnactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaamg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cangaacrt gcaatcaaga tagcagrtaa caagaattta gatgaaatca aaggacacac 
1621 tgcaatcttc tctgatgm ctggnctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1 74 1 acgngtgaa aagtcctcat tctacatcn cagncacct agnctcaat gcaataagtg 
1 80 1 ttacnagaa gngatctcc ccggagacga actccgtcct tctatgtaaa aactttrgca 
1 86 1 agagaaagga aaacnggtg gtggtactga mcccctat gagtgcartg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ngcagaagg 
1981 atancagat atcaatgtta gaggcagnc cattgnaac agcatcaaaa agtacaagga 
204 1 tgaagtaaat cctaacatta aaatcmgc agttgactta gaaggnacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag aiancggta tgagcgattc 
2161 aatcttaaag rtcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
222! ctngcccn caaaaaatag gacaaaagtg agtttcttga gattcrtcta taacaaaaat 
2281 ctcaccccac ttttrtgttt tangcatag ccanatgaa amaaana ttatctattt 
234 1 atttaagna cnacatagt ttatgtatcg cagtctana gcctancaa atganctgc 
2401 aaagaacaaa aaagattaaa a 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 

WVTAACLLQEGSYYQDKX»EPJlYIITKALLEVAESDPEFICQLAVYIR>fELYIRTTTN 

YIVAFCWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 

QDIRKELTFRXCLQRCVRSKFSEFNEYQLGKYCTESQRKKTMPRYLSVTNKQKWDQTK 

KJ<JIK£NLLTKJLQAIK£SEDKSKJICT 

APKIPNSTLESKYLTFKDLIKFCH1SEPKERVYKILGKKYPKTEEEYKAAFGDSASAP 

FNPELAGKRMKJEISKT\VENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNTLKAGV 
SDTTHSrVTNKICEPKAVENSKJvlFPLQFFSAlEAVNEAVTK^ 
O AVKXN^KTDEEKJCDMELEQTEEGEFVKVNEGIGKQYINSIELA^ 
H HTAJFSDVSGSMSTSMSGGAKXYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQ 
Jt CNKCYLEVDLPGDELRPSMQKJLLQEKGKLGGGTDFPYECIDEWTKNKTHVDNIVILSD 
::|J N4MIAEGYSDn^RGSSIVNSIKKYlCDEVNPNIKJFAVDLEGYGKCLNLGDEFNEN^ 
L KIFGMSDSILKFISAKQGGANMVEVIKNFALQKIGQK 



Figure 44 

1 tcaatactat taattaataa aiaaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
6 1 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
1 2 1 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
1 8 1 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
24 1 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg acrttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggnaa gtattaaaat tlagtattta acatggacta 
.481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
54 1 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
60 1 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
66 1 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaacrta ctaataanc ttactagact. gttaacatag acgttaattt 
781 tgataaiaat ctctgtatac tcgcangct tagattttta rtatcactag aaagattcaa 
84 1 tatrrtgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
90 1 gctacrtgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
96 1 acaagrtcct tgcgaagcgt tctaatatxt agttaactcc tcatcataaa ttagcgttaa 
1 02 1 agatagctaa ttataggtat actcmctc tacagacna aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcrtataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1 20 1 caagcatgct aatcttaatt tagrnctat ccctacctaa ttcaattttg atttctactt 
1 26 1 tgttaattta taacatnga aatiagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcngaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttnac acctacgng cttaagaaac ctccagaaaa cagatattaa aacaagctac 
144 1 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
] 50 1 aactccaagc gaaagcacaa gtggtatgaa attttttgat catcrttctg aattaaccga 
1561 gcctgaagat ncagcgtta acngtaagc tacccaagaa acttatgata gcttgcacaa 
1 62 1 acrmgatl agatcaacaa arttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1 74 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actrtnata aarrtaagct gaccnaaac taagaanat aacacgctaa 
1861 gtatacrtn aagtagaacg aatrnaart taataacgtt aaaagtgcaa aaattgaatc 
1 92 1 nccfcarta gaaagcrtag aagatanga lagtcrngc aaatctang cncngtaa 
1981 aaamacaa aatgttaata natcgccag tngctctat cccaacaata tnagaaaaa 
204 1 tccmcaat aagcccaatc nctatnn caagcaam gaataattga aaaatngga 
2101 aaatgtatct atcaactgta ttcngatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
222 1 tcttganat actaaattat ttaaaacact tcaatagna cctgaattaa attaagttta 
228 1 canaanag caattagaag aangaccgt gagtgaagta cataagtaag tatgggaaaa 
234 1 ccacaagcaa aaagctttct atgaaccan atgtgagttt atcaaagaat catcctaaac 
240 1 cctttagcta atagattng accaaaacac tgtaagtgat gactctatta aaaagatttt 
246 1 agaatctata tctgagtcta agtatcatca ttatttgaga ngaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtma gtaaaagcat actataaan ccctctatgt ttaccaactg gtacttatta 
264 1 cgattacaat tcagatagat ggtgaaaat taaatanag rnaaataaa tattaaatat 
270 1 tgaatatnc tngcttan amgaataa tacatacaat agtcattm agtgtmga 
2761 atatartna gnatxtaat tcanarm aagtaaataa nattmca atcattttrt 
282! aaaaaatcg 



Figure 45 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 

EEDLKLLKFKNQDQDGNSCNDDDDEENNSNKQQELLRRVNQIKQQVQLIKKVGSKVEK 

DLNLNEDENKKNGLSEQQVKXEQLRTTTEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 

ETDYDTEKWEISHDQIWYVS!YANQKTSYCWWLKDYFNK>n^DHLhrVSINRLETEAE 

FYAFDDFSQTIKXTWSYQTVNIDVNFDNNLCILALLRFLLSLERFMLNmSSYTRW 

QYNFEKJGELLET1FAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STOLKJLVDTNKVQDYFKJLQEFPRl/ITrVSQQAIPV^ 

VSIPTQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNLKFLRLNFYTY 

VAQETSRXQILKQATTIK>iLK>WKNQEETPETKDETPSESTSGMKJFDHLSELTELED 
I;;! FSVNLQATQEIYDSLHKLLIRSTNLKJ<JKLSYKYEMEKSKMDTFIDLKNrrETLNNLK 
tt RCSVNISNPHGNISYELTNKBSTFYKJKXTLNQELQHAKYTFKQNEFQFJWVKSAKIE 

SSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNPFNKPNLLFFKQFEQLK 
l A NLENVSrNCILDQHlLNSISEFLEKNKJCIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 
H= LNQVYFNOQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFDQNTVSD 
n DSIKJCJLESISESKYHHYLRLNPSQSSSL1KSENEEIQELLKACDEKGVLVKAYYKFP 

LCLPTGTYYDYNSDRW 



Figure 46 

MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSEIKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKJARNEDVNNSLFCHS 

anvnvtllkgaawkmfhslvgtyafvdllinytviqfngqfftqivgnrcnephlppk 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLR 
EAIFPTNLVKIPQRJLKVRJNLTLQKLLKJUIKJILNYVSILNSICPPLEGTVLDLSHLSR 
QSPKERVLKPIIVlLQKLLPQEMFGSKJCNKGKJIKNLNLLLSLPLNGYLPFDSLLKjCL 
RJLKDFRV^FISDIWTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VYFRf©TWNKLITPFIVEYFKTYLVENNVC 

IIAIPCRGADEEEFTIYKJENHKNAIQPTQKJLEYLR^KRPTSFTKIYSPTQIADRJKE 
H FKQPXLKJ<JT^LPELYFMKJDVKSCYDSIPRMECMRILKDALK>JENGFFVRSQYFFN 
\ l f TNTGVLKXFhAA^ASRVPKJ'YELYIDNfVRTVHLSNQDVrNVVEMEIFKTALWVEDKCY 
i| [REDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 
f rNIKJO-AMGGFQKYNAKANRDKJLAVSSQSDDDT^IQFCAMmFVKELEVWKHSSTMN 
11 NFHIRSKSSKGIFRSHALFNTRJSYKTIDTNLNSTNTVLMQIDMVVKNISECYKSAF 
IS KDLSINVTQNMQFHSFLQRliEMTVSGCPITKCDPLlEYEVRFTILNGFLESLSSNTS 
O KFKJDNItLLRKEIOHLQAYIYiYIHIVN 



• 



Figure 47 



Oxytncha LCVSYiLSSFYY ANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSiLSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 



Figure 48 



ATTTATACT CATG AAAATCTTATT CG AGTTCATT CAAG ACAAG CTTG ACATTG ATCT ACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

T CTTTTTTG CCATTCTG CAAATGTTAACGTTACGTT ACTGAAAGGCG CTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATG AGG ATT AT ACC AAAAAAAAGTAATAATG AGTT C AGG ATT ATTG C C AT CC CATG CAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 

TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



# 

Figure 49 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREK.R 

AERLTSRVKALFSVLNYERA 



Figure 50 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAG CTGCTC AGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCi 1 1 i lCTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 



Figure 5 1 



MTEHHTPKSRJLRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 
DSKPDEGVQFSSPKCSQSEL1ANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 
STFPNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRK 
RTIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQD^ 

LfNAFQVKQLHKVlPLVSQSTVVPKRLLKVYPLIEQTAKRLHRJSLSKVYNHYCPYIDTHDDEKILS 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLiCLSRYESFSLHYLMSNIKISEIEWLVL 

GKJISNAK^CLSDFEKJIKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRK 

SMKMEAFEKINE>OWRJViDTQKTT^ 

LRPVASILKitLINEESSGIPFNLEVYMKLLTFKKDLL^ 

IVKKJCLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKML1CEHLSGHIVKIGNSQYLQKVGIPQGS1LSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRVVDDFLFITWKJCDAKXFLNLSLRGF 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 

YRLGYSMCMRAQAYLKJmKD[FIPQRJv!FITDLLNVlGRKIWKKLAEILGYTSRRFLSSAEVKW 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRJAD 



Figure 52 



ggtaccgatnacmccructtcaiaagctaattgcucctcgaacgctcctaaatctctggaaatamnacaagaactcaataacaataccaagtcaaanccaatatgaagg 
tgttanagtgatcgataatamctattttatcggtcgnaccaagutaaggacaaaaagaa 

ggncgcnacrmaatcgtggtactgttnagctgctacnctagccaaccgcgtgmctaccccgtcattggatatagctcnggagtagctcacagaaatccnacaaatcn 
ctgatgagactaianagancattacagtccgtgcatattcttaacaiggagccttacactttagatgagtcacgtcgcatgatggagtatnggtatcatccaacgtttgccng 
aaaaggngataanarngcaaaatcatgtccnagtggtggmtccgcgaaagtttmgatgcngcacacgtctagcatgangagatartcaaaaamctatccactac 
ctccmaacgcggTrnarrmctatmctattctcatgngttccaaatatgtatcafc^ 

aataatctaaanagmcgcnataangatagtagtagaaagattggtganctactcgtgtaatg^anagmaaagatacmgcaaaacatnattagctatcattatataaaa 
aaaaicctataanataaaLactaatcaatamgcggtcactatttatttaaaacgnatgatcagt^^ 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 
CCTTAAATGATTATGTACAACTTfeTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 
CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTAi 1 1 1 1 CTTCATTCGACTGTAGTCGGCTTCG ACAGT 
AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTC 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGG l I I l CCATGgtaaggtanctaattgtgaaatatnacctgcaanactgrncaaagagangtatttaaccgataaagAA 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAAC III I GTTAGAAATgtaaataccggttaagatgttgcgcactngaaca 

agactgacaagiatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTA I il l IGAGGCTCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 

aactaatactgnatccncataactaarmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATT^ 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

tattgtataaaamanaccactaacgattttaccagACCTCGAAA 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatrttrrtaccanaattaacaaicagATTTCAGAAATTGAATGG 

CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 

A A TTC ATCT A CTG GCT A T AC A A TTCGTTTA TAATA CCT ATTTTAC A ATCI" I "rriri" I ATATCACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACC^ 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgrannaaagtattttngcaaaaagctaatacmcagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtattaactmggtcatcaatgtacma^ 

cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgancctcananaattngcagGCGTAAGAAG 

TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 

AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 

AGCTACAAAAAACTTTGTTAGTGAGGCG i i l i CCTATTgtaagtnattmtcanggaatrtrnaacaaancntmagTTGATAT 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGA 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 

TAAGgtataccaattgngaangtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCCTC 

AGGGCTCAATTCTGTCATC I I 1 11 1 GTGTC ATTTCTATATG G A A G ATTTG ATTG ATG A A TA CCTATCGTT 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCA AAAAAA i l l i l GAATTTATCTTTAAGAGgtgagttgctgtcancciaagttctaaccgttgaagGATTTGAGAA 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 

ATAC 11 1 1 I [ T A ATG A A AGC A AG A A A AG A ATGCC ATTCTTCGGTTTCTCTGTGAAC ATG AGGTCTCTTG 

ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 

ATATGGGGAAATCIT1 I 1 I 1 i ACAAAA 1 l C 1 AAGgtatactgtgtaactgaataatagctgacaaataatcagA TCGAGCCTTGC 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAA7TCAAAATTCAATTCTTGCTGCAATATATATAG 

GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 

AAAGAATGTTCATAACGGgtgagtacttannaactagaaaagtcattaattaaccttagATCi I 1 I GAATGTTATTGGAAGAAAA 

ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 



Figure 52 (cont.) 



ATGgtacgtgtcggtctcgagacttcagcaatattgacacatcagGCTTTTTTGTCTTGGAATGAGAGA TGGTTTG A A ACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAA 

canactggtgtcnaaacaatananactaagtatagctgacccccaaagcaagcatacutaggatnctagtaaagtaaaanaacctcgnanagtmgangacngtctn 
atccnatactmaagaaagangacagtggngctgactactgcccacatgcccanaaacgggagt^ 

aataaggaaagtggttxtctataatgaataatgcccgcactaatgcaaaaagacgaagattatcnctaaacaaggggganaagcaiatccgaaggaaaagagagtaatat 
acccagtgugngaagaaagcaaggataatnggaacaagcnctgcagatgacaggciaaattnggtgaccgaattttggtaaaagccccaggnatccatggiggccg 
gccttgctactgagacgaaaagaaactaaggatagmgaatactaatagctcattmtgtcrtatataaggtmgt^^ 

gtgnaagccanangganccgaaaLagccaaantcnggucctcaaagcggaagtctaaagaacttangaagcttatgaggcncaaaaactcctcctgantaaaggag 
gaatcttccaccgatgaggaaatggatagcnatcagctgctgaggagaagcctaattmtgcaaaaaagaaaatatcangggagacatctcngatgaatcagatgcgga 
gagtatctccagcggatccngatgtcaataacnctamctgaaatgtatggtcctactgtcgcttcgacncicgcagctctacgcagttaagtgaccaaaggtacc 



Figure 53 
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EST2 pep 

Euplotes pep 

Trans of cetirahyrnen 

Consensus 



FFYCTEISST VTXVYr RHDT WN KLIT V FIVE YFK-TYLVEN 40 

FFYVTEQQKS YSKTYYYBKN IWDVI-MKMS IAD LKK ETLA--EVQE 43 

KHKE GSQIFYYRKP IWKLVSfCLTI VKVRXQFSEK MKQMKNNFYQ 44 

FFY.TE..X. .S..YYYKK. IW...-KL — F..K V. . 50 



EST2 pep 

Euplotes pep 

Trans of cecrahymen 

Consensus 



NVCRNHNSY- TLSNF^^t^ISKK^NNE pWlklPCRG 

KEVEEWKKSL GFAPGK [if RLlS^IT-- FR F I "ITFNKK 

KIQLEEEKLE KVEEKLI PED SrQKYPQG^SRI^P^^S-- F I ^TFLRK 

:<...£ . . . . f . .G^ : ?gifnf^'^tq. . 



lE3tttEMTF . RX 



79 
78 
32 

100 



EST2 pep 
lEuploces pep 
4Trans of cetrahymen 

^Consensus 



ADEE3FTIYX ENHKNAIQP^QKILKt 3NK 

IVNSDRXTTX LTTNTKLLN^i^HI^MLKI i^N- 

DKQKNIK U^3U^Ir>OS^§LVFR^£KI^- 

K..K LN.N. .L. .S^QL.L. J£4d5- 



RPTSrTKIYS FTQXADRIKE 
--RMFK -~?r GFAVFN 
— ML-G -QXIGYSVFD 



■. -IG. .VF. 



129 
120 
130 

150 



*EST2 pep 
^Euplotes pep 
fiTrans of cetrahymen 

^Consensus 



FKQRLU&tN NVL -JScE 

YD-DVKFK YE EFVCKWKQVG cl^FF 
NK-QISE8?A QFIEKWKNKG p£:g*YVTL- 



r . . KWK. .G 



iTFMKFD VKSCYD 
ATMD ZEKCYD 



3! 

K&T.T.D 



-CYD 



157 
155 
158 

186 
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II 



S- 1 : FFY VTE TTF QKN RLF FYR KSV WSK 

S-2: RC)H LKR VQL RDV SEA EVR QHR EA 

S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: AKF LHW LMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FTP KPD GLR PIV NMD YW 



# 
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A 



Vector Genomic DNA insert Vector 



AS 

B2 «■■— 

5825 bp Sequenced 
-2 kb Hind 111 Fragment 




B 



tezl* 



RT Motits 1 2 3(A) 4(8*) 5(C) 6(D) 



Introns 




Hind ill 



500 bp 
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Poly 4 

t t c 

t a a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4(B f ) 



5(C) 

D D Y L L I T 

3'- ctg ctg acg gag gag cag egg -5' 

a a aaaaa a a 

c t t t 

c c 
Polv 1 



Figure 57 




Motif B' (4) 

QTKGIPQG 



Motif C (5) 

DDYLLIT 



Figure 58 



# 



PCR Product M2 showed Reasonable Match 
with Other Teiomerase Proteins 



Ot I/CVS YX L S SFYYANLEENAL Q FIiRKESMD P EKP ETNLLMRIiT 

Ea_pl23 KGIPQGIiCVSSILSSFYYATIiEESSIiGFIiKDESMNPENPNVNLLMRLTDDYLLIT 

Sp_M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

Sc_pl03 DGLFQGSSLSAPXVDLVYDDLI-EFYSEFKASPS QDTLILKLADDFLIIS 



QKVGIPQG 

caa aaa get ggz acc ccc cag gg < Actual Genomic Sequence. 

PolvW4 

' : ^e c c 



c a 



^a g c c ccg 



cag Jtdc aaa gga acc cca caa gg > 

ag a^cic aaa gga acc cca cca ggC TCA ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 

cc egg cct ccc caa ggc age ccG AGT TAA GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 

H n KG I PSGSILSSFLCHFYM 



GAA GA|T TTG ATT GAT GAA TAC CTA TCG TTT ACG AAA AAG AAA GGA. TCA GTG TTG TTA CGA 
CTT CTA AAC TAA CTA CTT ATG GAT AGC AAA TGC TTT TTC TTT CCT AGT CAC AAC AAT GCT 

E3LIDEYLSFTKKKGSVLLR 



GTA GTC gac gac cac ccc ccc acc acc 
CAT CAG ccg ccg a eg gag gag cag egg 

VVDDYLLTT 

< ccg ccg acg gag gag cag egg 

a a aaaaa a a 
e e c c 
c c 
Polv 1 



gac gac ccc ccc ccc aca aca 
D D F L F I T 



< Actual Genomic Sequence. 



Figure 59 



3' RT PGR Strategy 



rnRNA 



I AAAAAAAAAAAAAAAA 




1 . Synthesis ot cONA with €h; Primer 
rnRNA 



5* 
3' 



■ AAAAAAAAAAAAAAAA . 

■ rrrr wtimm 1 1 1 1 m 



15* 



2?f \rst Round PGR Using Outside Primer and Qq Pnmer. 



| ■ ri , rrrJ ; . 1 - rl . i . l l ; T 



3." Second Round PCR Using Jnside Pnmer ond Qj Primer, 



4. Sequence Second Round PCR Products Using inside Pnmer or Q{ Primer. 
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-Size Selected Libraries from P. Nurese 
3 - 4 kb 
5 - 6kb 
7 - 8 kb 
1 1 - 12 kb 

-Libraries from J. A. Wise 
Sau 3a Partial Digest 
Hind III Partial Digest 

cONA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. AUshire 

;SS , REP81ES Library (old) 

% REP8 1 ES Library (new) 

\j REP41ES Library 



B 



Original PGR 
m RT-PCR 




500 bp 




D 



Mg Cone. 



603- 



c 
o 
a 

d> 

<D 
C 



a 
a 

GO 



-a 
c 

X 



< 

Q 

a 

E 
o 
c 
a> 
0 




Hind III Digested Positive Genomic Clones 



Figure 62 



5' RT PCR Strategy 



ddA- 



mRNA 



I AAAAAAAAAAAAAAAA . 



1 . Synthesis of cDNA with Specific Downstream Primer. 

mRNA 



5* 

y 



I XAAAAAAAAAAAAAAA . 



'SJUgate Oligo with 5'-P and blocked 3* to cDNA ustng T4 RNA Ugase. 



3- First Round PCR 



W Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.D. Tezlp (429) . WLYNSFIIPILQSFFYITESSDLRNRT T v r YFRKDIW . . . (35) . . . 

S.c. Est2p (366). W1FRQLI PKI IQTFFYCTEISSTVT- 1 v'YFRHDTW ...(35)... 

E.a. p!23 (441). WI FEDL.WS L I RC FF YVTEQQKS Y SKTYYYRXN I W ...(35)... 

*• * * * * * * * * 

Motif 1 Motif 2 K 
p hh h K hR h R 

S.D. Tezlp AVIR1LPKK--NTFRLITN-LRKRF ...(61)... 
S.c. Est2p SKMRII PKKSNNEFRIIAIPCRGAD ...(62)... 

E.a. p!23 GKLR1 1 PKK--TTFRPIMTFNKr",IV ...(61)... 

* *- * * * * * 

Motif 3(A) AF 

h hDh GY h 
S.D. Tezlp V K Y FVR ID I KSCYDR Z KQDLMFR I VK ...(89)... 
S.c. Est2p E 1 Y FMKF DVK SC YDS Z P RMECMR ILK ...(75)... 

E.a. pl23 K1FFATMDIEKCYDSVNREKLSTFLK . . . (107) . . . 

* * * * * * 

Motif 4 (B ' ) 

hPQG pP hh h 

S.D. Tezlp YLQ KVG Z PQG S I LS S FLCHF YMEDL I DE YLS F . . . (6) . . . 

S.c. Est2p Y I REDG LFQG S S LS API VDLVYDDLLEF YS EF . . . (8) . . . 

E.a. p!23 YKQTKG I PQGLCVS S I L S S F YYATLE ES S LG F ...(14)... 
***** * * 

Y Motif 5(C) Motif 6(D) 

h r DDhhh Gh h cK h 

S.D. Tezlp VLLP.VTODFLFITVNKKD/JCKFLNLSL . (205) 

S.c. Est2p LILKLADDFLIISTEXSQQVINIKKLAMGGFQrCYI^AKAT'IRDKILAVSSQS . (173) 
E.a. D123 LLMRLTDDYLLITTQElNNAVLFIEKLINVSRENGF?vF>JMKKLQTSFPLS .(209) 
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Figure 65 

Disruption strategy for the putative telomerase genes. 



wild type telomerase gene 



wild type telomerase gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or ura4 marker. 



M 



telomerase gene::M 
wild type telomerase gene 



wild type telomerase gene 




2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 





M 









telomerase gene::M 



Wild Type (X2) 



Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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An Example of Confirmation of tezl disruption By PCR 
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Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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Figure 68 



1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu ohe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

qlu ala aiu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 



70 



80 



thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

qlu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 



130 



140 



ala ser val leu crly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 



160 



170 



phe val lys val asD val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu chr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 



200 



thr tyr cys val arg arg tyr ala val val gin lys ala ala met 

ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 

210 

gly thr ser ala arg pro ser arg ala thr ser tyr val gin cys 

GGC ACG TCC GCA AGG CCT TCA AGA GCC ACG TCC TAC GTC CAG TGC 

220 230 

gin gly ile pro gin gly ser ile leu ser thr leu leu cys ser 

CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC 

240 

leu cys tyr gly asp met glu asn lys leu phe ala gly ile arg 

CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG ATT' CGG 

250 260 

arg asp gly leu leu leu arg leu val asp asp phe leu leu val 

CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG TTG GTG 

270 

thr pro his leu thr his ala lys thr phe leu arg thr leu val 

ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC CTG GTC 

280 290 

arg gly val pro glu tyr gly cys val val asn leu arg lys thr 

CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG AAG ACA 

300 

val val asn phe pro val glu asp glu ala leu gly gly thr ala 

GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC ACG GCT 

310 320 

phe val gin met pro ala his gly leu phe pro trp cys gly leu 

TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC GGC CTG 

330 

leu leu asp thr arg thr leu glu val gin ser asp tyr ser ser 

CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC TCC AGC 

340 350 

tyr ala arg thr ser ile arg ala ser leu thr phe asn arg gly 
TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC CGC GGC 

360 

phe lys ala gly arg asn met arg arg lys leu phe gly val leu 
TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC TTG 

370 380 

arg leu lys cys his ser leu phe leu asp leu gin val asn ser 
CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC 
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390 

leu gin thr val cvs thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 4^0 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG .GAT 

450 

val ala gly aly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

ara ala val ala val oro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cvs his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro aso ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro aly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 
CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 



Figure 68 (cont^ 



AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTT C AGTTTTGG AAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pi 23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl . . . YV VELLRS FF YVTETTFQ KNRL . . . 

consensus FFY TE 

K 

Motif 0 phhh K hRh R 

Ep p 1 23 ...KSLGFAPGKLRLIPKKT— TFRPIMTFNKKIV.. 

Sp Tezl . . . QKTTLPP A VIRLLPKKN--TFRLITNLRKRFL . . 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAJPCRGAD. 

Hs TCP1 ...ARPALLTSRLRFIPBCPD--GLRPIVNMDYVVG 

consensus R PK R I 

AF 

Motif A h hDh GY h 

Ep pi 23 ...PKLFFATMDIEKCYDSVNREKLSTFLK... 

*Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

he Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

|Hs TCP1 ...PELYFVKVDVTGAYDTIPQDRLTEVI A.. .//... 

Jconsensus F D YD 

'Motif B hPQG pS hh 

lEp p 1 23 ...NGKJYKQTKGIPQGLCVSSILSSFYYA... 

!Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

i=Hs TCP 1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 

.consensus G QG S 



Motif C h FDDhhh 

iEppl23 ...PNVNLLMRLTDD YLLITTQENN. 

SpTezl ...KKGSVLLRVVDDFLFITVNKKJD. 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

Hs TCP1 ...RRDGLLLRLVDDFLLVTPHLTH.. 
consensus DD L 

Motif D Ghh cK. 

Ep p 1 23 ...NVSRENGFKPNMFCKL... 

Sp Tezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

HsTCPl ...LRTLVRGVPEYGCVV... 
consensus G 
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I GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
1 0 1 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
301 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
351 TCGGCTTCGC GCTGCTGG AC GGGGCCCGCG GGGGCCCCCC CG AGGCCTTC 
40 1 ACC ACCAGCG TGCGC AGCTA CCTGCCCAAC ACGGTGACCG ACGC ACTGCG 
45 1 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
60 1 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 C AACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
751 GTTGCCC AAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
SO 1 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAG AAGCC AC 
90 1 CTCTTTGG AG GGTGCGCTCT CTGGCACGCG CCACTCCC AC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
1 00 1 GGG ACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
1 05 1 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGAC A NTCTTTCTGG 
1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 
1 20 1 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1 25 1 CGCAGTGCCC CTACGGGGTG TTCCTC AAGA CGCACTGCCC GCTGCGAGCT 
1301 GCGGTCACCC CAGC AGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
1 40 1 CTGCTCCGCC AGC AC AGC AG CCCCTGGC AG GTGTACGGCT TCGTGCGGGC 
1 45 1 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
1 50 1 AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 
i 55 1 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
! 60 1 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1 65 1 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
1 70 1 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACC AC 
1 75 1 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1 80 1 TGC AAAGCAT TGGAATCAGA CAGC ACTTGA AGAGGGTGCA GCTGCGGGAG 
1 85 1 CTGTCGGAAG CAGAGGTCAG GCAGC ATCGG G AAGCCAGGC CCGCCCTGCT 
1 90 1 G ACGTCC AG A CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
195 1 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
200 1 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACG ATATCC A C AGGGCCTGG CGC ACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
220 1 CGACACC ATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGCATCATCA 
225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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2301 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
23 5 1 AGACCTCC AG CCGTACATGC G AC AGTTCGT GGCTC ACCTG C AGG AN AAC A 
240 1 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
245 1 GCCAGC AGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCC ACC ACGC 
250 1 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
25 5 1 GCTCCATCCT CTCC ACGCTG CTCTGCAGCC TGTGCTACGG CG ACATGGAG 
260 1 AAC AAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
270 1 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
275 1 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACG AGGCCC TGGGTGGC AC 
2801 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
285 1 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
290 1 CGGACCTCCA TC AGAGCCAG TCTCACCTTC A ACCGCGGCT TCAAGGCTGG 
295 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
300 1 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCC AGACGGT GTGC ACCAAC 
305 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTC ACG CATGT GTGCT 
3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCC ACA mi iCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT G AAAGCCAAG 
3201 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
325 1 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GAC AGCCC AG 
3351 ACGCAGCTGA GTCGG AAGCT CCCGGGGACG ACGCTG ACTG CCCTGGAGGC 
340 1 CGCAGCCAAC CCGGCACTGC CCTCAGACTT C AAGACC ATC CTGG ACTG AT 
345 1 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACC AGCA GCCCTGTCAC 
3501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
3551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
360 1 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCC AA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
3701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
375 1 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3801 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
390 1 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCA AATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
400 1 TTGAAAAAAA AAA AAAAAAA AAAAAAAAA 
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1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 6C 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



130 

cyr leu pro asn thr val thr asp ala leu arg gly ser glv ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 



Figure 74 



140 

trp glv leu leu leu arg arg 

TGG GGG CTG CTG CTG CGC CGC 

"leu leu ala arg cys ala leu 

CTG CTG GCA CGC TGC GCG CTC 

170 

ala tyr gin val cys gly pro 

GCC TAG CAG GTG TGC GGG CCG 



zhr gin ala arg pro pro pro 

ACT CAG GCC CGG CCC CCG CCA 

200 

leu gly cys glu arg ala trp 

CTG GGA TGC GAA CGG GCC TGG 



val pro leu gly leu pro ala 
GTC CCC CTG GGC CTG CCA GCC 

230 

ser ala ser arg ser leu pro 
AGT GCC AGC CGA AGT CTG CCG 



ala ala pro glu pro glu arg 
GCT GCC CCT GAG CCG GAG CGG 

250 

ala his pro gly arg thr arg 
GCC CAC CCG GGC AGG ACG CGT 



val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 



•)_ 



150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

150 

phe val leu val ala pro ser cys 

TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAC CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC CAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGG CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 

270 

glv pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 

280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 
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2S0 300 
gly ala leu ser gly chr arg his ser his pro ssr val gly arg 
GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pre ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
crp asp chr pro cys pro pro val cyr ala glu chr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAG GCC GAG ACC AAG CAC TTC 

340 

leu ryr ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu chr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

3 70 

glu chr lie phe leu gly ser arg pro crp met pre gly chr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

3S0 390 
arg arc leu pro arg leu pre gin arg cyr crp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys chr his cys pro leu arg aia aia val chr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 

430 

ala ala gly val cys ala arg glu lys pro gin gly ser val ala 
GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 
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440 450 

aia pre glu glu glu asp thr asp pro arg arc leu val gin leu 

GCC CCC GAG GAG GAG GAC ACA GAC CCC CG? CGC CTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val cyr gly one val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 480 
aia cys leu arg arg leu val pro pre gly leu crp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

300 510 
leu gly iys his aia lys leu ser leu gin glu leu chr crp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAC- GAG CTG ACC- TGG AAG 

520 

mec ser val arg asp cys aia trp leu arg arg ser pro gly val 
ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 
gly cys val pro aia aia glu his arg leu arg glu giu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

aia iys phe leu his crp leu met ser val cyr val val giu leu 
GCC AAG TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe cyr val thr glu chr chr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACC- TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 

590 500 
lie gly :le arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
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610 

ser glu ala glu val arg gin his arg glu ala arg pre ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

pro ile val asn met asp tyr val val gly ala arg thr phe arg 
CCG ATT C-TG AAC ATC- GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn cyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 6S0 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC C-TG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTC- 

710 720 
tyr phe val lys val asp val thr gly ala cyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

720 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 



740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 



(i 
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770 780 
chr asp leu gin pro cyr raec arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 



790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 



SOO 

ser leu asn glu ala 
TCC CTG AAT GAG GCC 



phe nee cys his his 
TTC ATG TGC CAC CAC 



ser ser gly leu phe 
AGC AGT GGC CTC TTC 

820 

ala val arg ile arg 
GCC GTG CGC ATC AGG 



810 

asp val phe leu arg 
GAC GTC TTC CTA CGC 



gly lys ser tyr val 
GGC AAG TCC TAC GTC 



230 840 
gin cys gin gly ile pro gin gly ser lie leu ser chr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 



850 

cys ser leu cys cyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 



360 370 
ile arg arg asp giy leu leu leu arg leu val asp asp phe leu 
ATT CGC- CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 



leu val chr pro his leu chr his 
TTG GTG ACA CCT CAC CTC ACC CAC 



880 

ala lys chr phe leu arg chr 
GCG AAA ACC TTC CTC AGG ACC 



890 

leu val arg gly val pro glu cyr gly cys val val 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG 



900 

asn leu arg 
GTG AAC TTG CGG 



910 

lys chr val val asn phe pro val glu asp glu ala leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC 



chr aia phe 
ACG GCT TTT 



920 

val gin 
GTT CAG 



meC 
ATG 



pre ala 
CCG GCC 



his 
CAC 



giy 
'•jGC 



leu phe 
CTA TTC 



930 

pro crp cys 
CCC TGG TGC 
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giy leu leu leu asp 
GGC CTG CTG CTG GAT 

950 

ser ser cyr ala arg 
TCC AGC TAT GCC CGG 



arg gly ?he lys ala 
CGC GGC TTC AAG GCT 



val leu arg leu lys 
GTC TTG CGG CTG AAG 



a sr. ser leu gin chr 
AAC AGC CTC CAC- ACG 

1010 

leu gin ala cyr arg 
CTG CAG GCG 7 AC AGG 



his gin gin val crp 
CAT CAG CAA GTT TGG 

104 0 

ser asp chr ala ser 
TCT C-AC ACG GCC TCC 

ala gly ~eC ser leu 
GCA C-GG ATG TCG CTG 

1070 

ser glu ala val gin 
TCC GAG GCC GTG CAG 



leu chr arg his arg 
CTG ACT CGA CAC CGT 



940 

thr arg chr leu glu 
ACC CGG ACC CTG GAG 



chr ser ile arg ala 
ACC TCC ATC AGA GCC 

970 

gly arg asn mac arg 
GGG AGG AAC ATG CGT 

cys his ser leu phe 
TGT CAC AGC CTG TTT 

1000 

val cys chr asn lie 
GTG TGC ACC AAC ATC 

phe his ala cys val 
TTT CAC GCA TGT GTG 

1030 

lys asn pro chr phe 
AAG AAC CCC AC A TTT 



leu cys cyr ser ile 
CTC TGC TAC TCC ATC 

T r*. r- r\ 

iv'OU 

gly ala lys gly ala 
GGG GCC AAG GGC GCC 

crp leu cys his gin 
TGG CTG TGC CAC CAA 

1090 

val chr cyr val pro 
GTC ACC TAC GTG CCA 



val gin ser asp cyr 
GTG CAG AGC GAC TAC 

960 

ser val chr phe asn 
AGT GTC ACC TTC AAC 



arg lys leu phe gly 

CGC AAA CTC TTT GGG 
990 

leu asp leu gin val 
CTG GAT TTG CAG GTG 



cyr lys ile leu leu 
TAC AAG ATC CTC CTG 

1020 

leu gin leu pro phe 
CTG CAG CTC CCA TTT 



phe leu arg val ile 
TTC CTG CGC GTC ATC 

1050 

leu lys ala lys asn 
CTG AAA GCC AAG AAC 



ala giy pro leu pro 
GCC GGC CCT CTG CCC 

1080 

ala phe leu leu lys 
GCA TTC CTG CTC AAG 

leu leu gly ser leu 
CTC CTG GGG TCA CTC 



Figure 74 (cont.) 



1110 

crin leu ser arg lys leu pro gly thr thr 
CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

aia ala ala asn pre ala leu pre ser asp 
GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1132 
aso 0? 

GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 
CACCAGCAGCCCTG7CACGCCGGGCTCTACGTCCCAGGGAGGGAC-GGGCGGCCCACACCC 
AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 
GGCTGAAGGCTGAGTC-TCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 
CAGCACACC7GCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 
CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 
CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

G AC CC TG AG AAGGAC C C T GGG AGCTCTGGGAATTTGG AGTG ACC AAAGGTGTGC CCTGT A 
CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAAT7GGGGGGAGGTGC 
iGTGGGACTAAJ^ATA.CTGAATATATGAGTTTTTCA.GTTTTGRAAAAAAAAAAAAAAA^ 

AAAAAAAAAA 



110G 

arg chr ala gin thr 
AGG ACA GCC CAG ACG 



leu 



hr ala leu glu 
CTG ACT GCC CTG GAG 



113G 

phe lys chr lie leu 
TTC AAG ACC ATC CTG 



Figure 75 
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